FastDup Software, (C) copyright 2022 Dr. Amir Alush and Dr. Danny Bickson.
########################################################################################
Dataset Analysis Summary:
Dataset contains 7384 images
Valid images are 100.00% (7,384) of the data, invalid are 0.00% (0) of the data
Similarity: 5.12% (378) belong to 11 similarity clusters (components).
94.88% (7,006) images do not belong to any similarity cluster.
Largest cluster has 24 (0.33%) images.
For a detailed analysis, use `.connected_components()`
(similarity threshold used is 0.8, connected component threshold used is 0.9).
Outliers: 6.13% (453) of images are possible outliers, and fall in the bottom 5.00% of similarity values.
For a detailed list of outliers, use `.outliers(data=True)`.