• search hit 3 of 3
Back to Result List

Advancements in Hand-Drawn Chemical Structure Recognition through an Enhanced DECIMER Architecture

  • Accurate recognition of hand-drawn chemical structures is crucial for digitising hand-written chemical information found in traditional laboratory notebooks or for facilitating stylus-based structure entry on tablets or smartphones. However, the inherent variability in hand-drawn structures poses challenges for existing Optical Chemical Structure Recognition (OCSR) software. To address this, we present an enhanced Deep lEarning for Chemical ImagE Recognition (DECIMER) architecture that leverages a combination of Convolutional Neural Networks (CNNs) and Transformers to improve the recognition of hand-drawn chemical structures. The model incorporates an EfficientNetV2 CNN encoder that extracts features from hand-drawn images, followed by a Transformer decoder that converts the extracted features into Simplified Molecular Input Line Entry System (SMILES) strings. Our models were trained using synthetic hand-drawn images generated by RanDepict, a tool for depicting chemical structures with different style elements. To evaluate the model's performance, a benchmark was performed using a real-world dataset of hand-drawn chemical structures. The results indicate that our improved DECIMER architecture exhibits a significantly enhanced recognition accuracy compared to other approaches.

Export metadata

Metadaten
Author:Kohulan Rajan, Henning Otto Brinkhaus, Achim Zielesny, Christoph Steinbeck
DOI:https://doi.org/10.26434/chemrxiv-2024-7ch9f
Document Type:Preprint
Language:English
Date of Publication (online):2024/07/15
Year of first Publication:2024
Publishing Institution:Westfälische Hochschule Gelsenkirchen Bocholt Recklinghausen
Release Date:2024/07/23
Tag:DECIMER; Deep Learning; OCSR, Optical Chemical Structure Recognition; Transformer
Departments / faculties:Institute / Institut für biologische und chemische Informatik
Licence (German):License LogoEs gilt das Urheberrechtsgesetz

$Rev: 13159 $