Testing and Issues

You can test this app and submit issues during the testing period of the Data Clustering Contest, Stage 2 contest.

Entries with serious issues will not be able to win the contest, but even minor issues might be important for overall results.


Fair Leopard Jul 7 at 16:10
In our preliminary tests, this submission received the following scores (out of 100):

Languages: 100
News EN: 81
News RU: 81
Categories EN: 48
Categories RU: 51
Threads EN: 0
Threads RU: 0
Kingly Hedgehog Jul 7 at 17:47
The scores of the News part seem strange.
As you can see in the comments below, there was a problem in running the News part which was solved. So I'm guessing maybe the scores were calculated based on the old outputs.
Would it be possible for you to double-check that?
Thank you
Fair Leopard Jun 23 at 20:52
We had to fix the following issues before running the algorithm and will apply relevant penalties during the final scoring:
- no tgnews binary in the root folder;
missed 8 modules: iso8601 numpy fasttext gensim torch tqdm hdbscan flask
Kingly Hedgehog Jun 24 at 17:41
Hello. I hope you are doing well. There is a version mismatch between your and my installed versions of the scikit-learn library. The correct version for running the news part is 0.22.2.post1.

To fix the problem, you should run these instructions:

- pip uninstall scikit-learn
- pip install scikit-learn==0.22.2.post1

It is better to run the first command twice. 

There is a solution for running the threading part as well. But writing both of these comments in a single reply causes exceeding the text size limit. It is not possible to reply twice to your message or adding any other text, so i attach it as an image.

Fair Leopard Jul 3 at 00:31
We re-ran your algorithm with scikit-learn==0.22.2.post1 installed. The source code can not be changed after the deadline.
Fair Leopard Jul 7 at 18:07
#issue11310 You are right. Your scores were updated
Kingly Hedgehog Jul 7 at 18:12
Thank you.
