In a technology-forward world, sometimes the best and easiest tools are still pen and paper. Organic chemists frequently draw out molecular work with the Skeletal formula, a structural notation used for centuries. Recent publications are also annotated with machine-readable chemical descriptions (InChI), but there are decades of scanned documents that can't be automatically searched for specific chemical depictions. 

Automated recognition of optical chemical structures, with the help of machine learning, could speed up research and development .

Our talk will walk through the various high level aspects of our 7th place Gold medal winning approach on Kaggle.

Specifically, we will present our model architectures, loss functions, ensemble strategy and post-processing methods.

Local ODSC chapter in Kolkata, India

Instructor's Bio

Rajneesh Tiwari

 Senior Data Scientist/Associate Director at Novartis | Kaggle Competitions Master

Rajneesh works as a Data Scientist II at Novartis and has 10+ years of ML experience across varied domains such as Telecom, Retail, Pharma, etc. He loves competing on Kaggle and is current ranked top1% within Kaggle’s competitions tier.

 Tanul Singh

NLP Engineer at Jarvis | Kaggle Notebooks Grandmaster&Kaggle Competitions Master

Tanul Singh works as an NLP Engineer at Javis. Tanul loves NLP, and works to build cutting edge NLP based solutions. He is a Kaggle Competitions Master and a Kaggle Notebooks Grandmaster (Ranked 9 Globally)

 Shivam Gupta

 Data and Applied Scientist at Microsoft | Kaggle Competitions Master

Shivam works as Data and Applied Scientist at Microsoft. He is working on selection and relevance tasks using DNN models on sponsored search ads data. He completed his master’s degree from IIT Kharagpur in Computer Science and Data Processing. He is also Kaggle Competitions Master with Global Rank of 303.

Nischay Dhankhar 

Student at NSUT Delhi | Kaggle Competitions Master

Nischay is currently a second year Electronics and Communication Engineering student at NSUT, Delhi. He is also a Kaggle competitions master, participated in several competitions within the last two years and is globally ranked under top 50. He has passion for ML modelling specifically in tabular data, computer vision and ensemble learning.


  • 1

    A Kaggle winning approach for converting chemical molecular images to Inchi string representations via deep learning methods

    • Ai+ Training

    • Webinar recording

    • Join ODSC APAC 2021 Training Conference