Info

Open Website

Testing and Issues

You can test this app and submit issues during the testing period of the Data Clustering Contest contest.

Entries with serious issues will not be able to win the contest, but even minor issues might be important for overall results.

Voting

9

Comments

Timeout..35k articles. Looks like I should used search similar articles inside categories. That could be much faster.
Unfortunately, I did not expect one-language-cleaned dataset (en_source_dir, ru_source_dir) which lead to error 'threads' and 'top'. I'ts simple fix but not allowed by rules. Could you please rerun against raw (raw_source_dir) instead. That should be fitted in time.
You have not added any comments yet...
by rating

Issues

Fair Leopard Feb 28, 2020 at 15:11
Final score for this submission (out of 100):

Languages: 12.77
News EN: 12.59
News RU: 12.73
Categories EN: 12.46
Categories RU: 12.61
Threads EN: 0
Threads RU: 0
Unfortunately, this submission didn't get a high enough score to be evaluated for Top news (task 5).

These data reflect the relative accuracy, precision and speed of the algorithm as compared to the other submissions.
30
Fair Leopard Feb 6, 2020 at 16:03
In our preliminary tests, this submission received the following scores (out of 100):

Languages: 95
News EN: 38
News RU: 43
Categories EN: 19
Categories RU: 11
Threads EN: 0
Threads RU: 0

Unfortunately, this submission didn't get a high enough score for the final task (top news) to be evaluated.

This is not the final result, please stay tuned for updates. We apologize for the delay.
20
Fair Leopard Dec 15, 2019 at 14:03
#comment9980
We had to re-run your algorithm with extra articles but it had no effect.
10
Noble Squirrel Dec 15, 2019 at 14:34
Yep, anyway, it's stil too long to fit. ~10 minutes on i7's 2,66 four cores for 4,8k eng articles and much longer for 7,8k rus. Thank you Leopard.
Nobody added any issues yet...