Open Website

Testing and Issues

You can test this app and submit issues during the testing period of the Data Clustering Contest contest.

Entries with serious issues will not be able to win the contest, but even minor issues might be important for overall results.


by rating


Fair Leopard Feb 28, 2020 at 15:11
Final score for this submission (out of 100):

Languages: 13.29
News EN: 13.26
News RU: 13.25
Categories EN: 25.23
Categories RU: 13.25
Threads EN: 23.3
Threads RU: 13.08
Top news EN: 10.49
Top news RU: 10.66

These data reflect the relative accuracy, precision and speed of the algorithm as compared to the other submissions.
Fair Leopard Feb 6, 2020 at 16:03
In our preliminary tests, this submission received the following scores (out of 100):

Languages: 78
News EN: 87
News RU: 34
Categories EN: 53
Categories RU: 6
Threads EN: 58
Threads RU: 19

Unfortunately, this submission didn't get a high enough score for the final task (top news) to be evaluated.

This is not the final result, please stay tuned for updates. We apologize for the delay.
Fair Leopard Dec 12, 2019 at 15:58
We had to fix the following issues before running the algorithm and will apply relevant penalties during the final scoring:
- invalid languages output format, fixed "language" => "lang_code"
- invalid top output format, fixed uppercased categories
Poor clustering for Russian. Moreover, it seems that clusters are ranked using cluster size only.
Gentle Cockroach Dec 12, 2019 at 19:01
I am sorry. I am not a native speaker of that language. It was my first time using Russian in a project and I used it hastily.
Samsung and Black Friday rule the Science top
Gentle Cockroach Dec 13, 2019 at 06:54
I understand the issue. Thanks for report
Looks like you have some disbalance in russian labels, but I like top threads in your solution, so take my thumb up
Gentle Cockroach Dec 12, 2019 at 20:48
I'll try to learn more Russian. Thanks
Nobody added any issues yet...