Could this be useful?…
Could this be useful?
I collect events from all relays for the last hour, group events w/ common words/ngrams, find clusters of >100 events.
This API prints the stop-words for big clusters - if event contains all of words, it's most likely spam. Relays/clients could proactively match new events against these words, or periodically delete specific events/pubkeys.
Was playing with this today, will be using in my relay. It's updated close to real-time.
Also