jina-ai / embedding-fingerprints
PublicIdentify which embedding model produced a vector using digit-level tokenization and a tiny transformer
A research tool that trains a small neural network to identify which text embedding model generated a given vector by analyzing patterns in its numerical values.
How It Works
You come across a neat invention that figures out which AI service created a bunch of mysterious numbers from text.
You create a simple list of everyday sentences, like quotes or questions, to use for testing.
You feed your sentences to lots of different AI services and save their unique number outputs as training examples.
You run a quick training session where a smart little helper learns to spot the differences in each AI's number style.
You check colorful charts that show how accurately it's learning to recognize each one.
Now you can take any unknown numbers and instantly know which AI made them, like a detective solving a mystery!
Star Growth
Repurpose is a Pro feature
Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.
Unlock RepurposeSimilar repos coming soon.