How to run Find Similar¶
What can I do with Find Similar?
- Group visually similar detections to reduce labeling effort.
- Propagate known labels to unknown detections.
- Prepare embeddings for semi-supervised LDA labeling.
- Visually inspect and correct clusters via FiftyOne's UMAP brain view.
1. Confirm prerequisites¶
- Run
Unified Detectionfirst; each detection must contain aleip_embedding.
Inspect detections

2. Launch and Execute the Find Similar operator¶
- Select the
Find Similaroperator in theBrowse Operationsmenu and clickExecute.
Execute Find Similar

Default clustering behavior
The default (and recommended) clustering method combines k-means clustering and Linear Discriminant Analysis (LDA). k-means clustering performs unsupervised grouping of detection embeddings into clusters of visually similar objects; LDA trains a lightweight classifier to extend labels to unlabeled detections. To run k-means clustering only, select that option under Show Advanced Options.
3. Inspect embedding visualization¶
- Open the embedding visualization by selecting
brain_key='find_similar_clusters'.
Open embedding visualization

- Confirm clusters are coherent (similar objects grouped together).
Confirm cluster coherency

- Check unknowns: many should now have labels from majority voting or LDA.
Tip
After modifying a few labels manually, re-run Find Similar to allow LDA to unlock and propagate corrections automatically.
Troubleshooting¶
If you run into problems, consult the troubleshooting guide.