I bet that +90% of these notes are from spammy accounts. If you want I can give you access to
@Vertex for free. You can use its API to rank profiles, and if the rank of the event pubkey is lower than a threshold, you just remove it.
I can help you find suitable thresholds if needed