Info

Open Website

Testing and Issues

You can test this app and submit issues during the testing period of the Data Clustering Contest, Stage 2 contest.

Entries with serious issues will not be able to win the contest, but even minor issues might be important for overall results.

Voting

13
by rating

Issues

Fair Leopard Jul 7, 2020 at 16:10
In our preliminary tests, this submission received the following scores (out of 100):

Languages: 33
News EN: 59
News RU: 83
Categories EN: 61
Categories RU: 81
Threads EN: 74
Threads RU: 63
10
Bright Deer Jul 7, 2020 at 16:20
thanks, looks good. would you mind re-running `top` according to my previous comment?
Fair Leopard Jun 24, 2020 at 12:28
If you have some comments for the judges about running your submission please leave a reply here.
Bright Deer Jun 24, 2020 at 12:51
Just don't understand why "top" timed out: when I tested with your scripts, I didn't have any problems even on a machine with smaller specs.
Fair Leopard Jun 24, 2020 at 22:29
There is no timeout. Your server processes requests too slow (first 1100 articles were indexed in 69.87 seconds. It is slower than 1000 requests per 60 seconds as described in the rules).
Note that we send simultaneous PUT requests (but no more than 100 at the time).
Bright Deer Jun 27, 2020 at 14:43
So we ran few tests (2020-05-20): our rps usually above 20 which is more than > 1200 articles / 60 seconds. Would you be able to run "Top" once again? The problem might be that certain html files are quite large which we've overlooked inside the tgnews app. Is it possible to ask you to cut down html contents to first 10 paragraphs? Alternatively I can change only ONE line of my code and re-submit if that's fine.
Fair Leopard Jul 7, 2020 at 17:44
Did you test how your server process simultaneous requests (up to 100 at the same time)?
Bright Deer Jul 8, 2020 at 05:42
yes we did. I posted some results in the post above. Those are execution times for requests sent in parallel. 40-60 rps -> 2400-3600 files/60s. I guess the problem might be that some html files are too large: can you set a limit like 10 paragraphs or 2500 characters?
Nobody added any issues yet...