Info

Open Website

Testing and Issues

You can test this app and submit issues during the testing period of the Data Clustering Contest, Stage 2 contest.

Entries with serious issues will not be able to win the contest, but even minor issues might be important for overall results.

Voting

44
by time

Issues

Как всегда высший класс
2
Only top is working :/
1
Swift Skunk Jun 23, 2020 at 22:11
Right. My code doesn't look for articles in subfolders of the given folder. I hope the judges can rerun it, dumping all articles in the same folder and giving that folder as input.
Fair Leopard Jun 24, 2020 at 12:28
If you have some comments for the judges about running your submission please leave a reply here.
Swift Skunk Jun 24, 2020 at 13:07
Hello! In languages, news, categories, and threads, my program couldn't find the HTML files. It doesn't look for files in subfolders of the given folder. Could you put all HTML files in one folder, and rerun my submission with this folder as source_dir? This way, it will find the files and work correctly. Sorry for my mistake! Thank you!
Fair Leopard Jun 24, 2020 at 23:18
We had to re-run your algorithm with all HTML files in one folder and will apply relevant penalties during the final scoring.
1
Swift Skunk Jun 25, 2020 at 00:10
Thank you!
Fair Leopard Jul 7, 2020 at 16:10
In our preliminary tests, this submission received the following scores (out of 100):

Languages: 66
News EN: 56
News RU: 54
Categories EN: 51
Categories RU: 49
Threads EN: 50
Threads RU: 33
10
Fair Quokka Jul 31, 2020 at 22:06
В ходе тестирования алгоритма были выявлены следующие недостатки в ранжировании:

1. RU
– Отсутствуют многие главные сюжеты в разделе ‘Main’ и внутри категорий.
– Заголовки некоторых сюжетов не отражают их содержание.
– Некоторые главные сюжеты нерелевантны для широкой аудитории из России.

2. EN
– Отсутствуют некоторые главные сюжеты в разделе ‘Main’ и внутри категорий.
– Заголовки некоторых сюжетов не отражают их содержание.
– Нарушена сортировка статей в некоторых сюжетах: нерелевантные статьи отображаются выше релевантных.
– Часть главных сюжетов нерелевантны для широкой англоязычной аудитории.
20
Nobody added any issues yet...