Info

Open Website

Testing and Issues

You can test this app and submit issues during the testing period of the Data Clustering Contest contest.

Entries with serious issues will not be able to win the contest, but even minor issues might be important for overall results.

Voting

15
by time

Issues

Fair Leopard Dec 12, 2019 at 14:57
The following issues have been discovered during preliminary testing:
- FileNotFoundError: [Errno 2] No such file or directory: '../src/utils/stopwords.json'
10
Fair Leopard Dec 13, 2019 at 14:21
#issue9758
We ran your binary from the root of your submission directory. The binary file and src folder are on the same level. So the correct path to stopwords.json from submission folder is './src/utils/stopwords.json'
Fair Leopard Dec 15, 2019 at 21:50
#issue9954
Where is src folder in your example? Can you show the output of the commands `ls -l ~/tgnews` and `ls -l ~/tgnews/submission`?
Huge Flamingo Dec 15, 2019 at 23:08
this really is an oversight on my part. i did have another src folder in the same folder containing the submission.🙈
Fair Leopard Dec 15, 2019 at 23:30
#issue10009
We had to re-run your algorithm from subfolder with extra articles and will apply relevant penalties during the final scoring.
10
Almost no news/non-news filtering
Very short threads. Weak merging of articles on the same topic
Getahun Mesele Jan 18, 2020 at 19:21
There’s mis-categorizing issues, how can the reports about South African beauty contest appears on science category?
Apple/safari/12.4
Fair Leopard Feb 6, 2020 at 16:03
In our preliminary tests, this submission received the following scores (out of 100):

Languages: 99
News EN: 87
News RU: 93
Categories EN: 57
Categories RU: 78
Threads EN: 69
Threads RU: 40
Top EN: 48
Top RU: 44

This is not the final result, please stay tuned for updates. We apologize for the delay.
30
Fair Mammoth Feb 7, 2020 at 16:09
В ходе предварительного тестирования алгоритма были выявлены следующие недостатки в ранжировании:

– В разделе 'Main' отсутствует часть важных сюжетов; не все важные сюжеты представлены в категориях 'Society', 'Economy', 'Enterntainment'.

– Часть заголовков сюжетов отрезаны. Заголовки некоторых сюжетов слишком размыты (информация не подаётся в краткой нейтральной форме).

– Нарушена сортировка статей в некоторых сюжетах: релевантные статьи смешаны с нерелевантными.
20
Fair Leopard Feb 28, 2020 at 15:11
Final score for this submission (out of 100):

Languages: 64.08
News EN: 0
News RU: 13.07
Categories EN: 31.38
Categories RU: 67.19
Threads EN: 35.4
Threads RU: 25.93
Top news EN: 10.28
Top news RU: 10.41

These data reflect the relative accuracy, precision and speed of the algorithm as compared to the other submissions.
30
Nobody added any issues yet...