For this challenge, we have teamed up with Battelle Memorial Institute - one of the most respected names in the global scientific & research community - to launch a Data Science competition that can help to dramatically accelerate the pace of global innovation. The goal of this project is to break down several barriers that currently stand in the way of advanced research publications getting noticed, and receiving prompt recognition from the world's brightest minds. This competition will also offer cash prizes to the authors of the top two ML models, as determined by our platform's evaluation algorithm. Please read on for more details, and good luck!
Battelle is solving the world’s most pressing challenges. We deliver when others can’t. We conduct research and development, manage laboratories, design and manufacture products, and deliver critical services for our clients – whether you are a multi-national corporation, a small start-up organization or a government agency. We are valued for our independence and ability to innovate.
We are part of a community working to encourage the discovery of new and interesting research in Artificial Intelligence and Machine Learning, especially in languages other than English. Much of the research being done in these fields is easily available on the web through sites like Arxiv.org, but many interesting discoveries are happening every day in different corners of the internet that may take time to identify and bring to the attention of the rest of the community.
This is especially true of research that is in a language other than English, which may easily missed by much of the community. We are passionate about finding the best current research, and identifying trends so that the cutting edge can continue to be pushed. In order to push in that direction, we devised a problem that attempts to measure when new ideas are being discussed, in any language. Based on a metric for recency of key words, how can we identify when a research paper is bringing forth new ideas so that we can better isolate them?
The data is a collection of 42.912 abstracts from recent publications, along with the language and year of publication. The abstracts have author given keywords associated with them, and they have been given scores based on the average number of years that those keywords show up in our database. The goal of this competition is to build a model that is able to take in the abstract, the language, and the publication year, and predict the recency score. These models will be scored based on the accuracy of their predictions.
This competition has a total duration of 2 months, within which you will be able to make your submissions and obtain results automatically. Once the first part of the competition is over, you will have one week to choose your best model and submit it to be scored and considered for the cash prize.
Share this competition:
For questions, Queries, Feedbacks & Suggestions, Contact:
Keep up to date by participating in our global community of data scientists and AI enthusiasts. We discuss the latest developments in data science competitions, new techniques for solving complex challenges, AI and machine learning models, and much more!