We've updated our Privacy Policy to make it clearer how we use your personal data. We use cookies to provide you with a better experience. You can read our Cookie Policy here.

Advertisement

Advancing Non-Animal Testing: New Software Reveals Gaps in Chemical Risk Models

A white mouse climbing over test tubes.
Credit: iStock.
Listen with
Speechify
0:00
Register for free to listen to this article
Thank you. Listen to this article using the player above.

Want to listen to this article for FREE?

Complete the form below to unlock access to ALL audio articles.

Read time: 2 minutes

Summary

University of Vienna researchers developed MolCompass, a tool that identifies blind spots in machine learning models used for chemical risk assessment. By mapping chemical compounds and highlighting areas where predictions fail, the tool aims to increase transparency and confidence in computational models, advancing non-animal testing methods in toxicology.

Key Takeaways

  • MolCompass helps identify areas in chemical space where machine learning models for toxicology may make incorrect predictions with high confidence.
  • The tool uses interactive maps to visualize and explore chemical compounds, revealing regions where models are unreliable.
  • This approach enhances the transparency and reliability of computational risk assessment, supporting non-animal testing alternatives.
  • In recent years, machine learning models have become increasingly popular for risk assessment of chemical compounds. However, they are often considered 'black boxes' due to their lack of transparency, leading to scepticism among toxicologists and regulatory authorities. To increase confidence in these models, researchers at the University of Vienna proposed to carefully identify the areas of chemical space where these models are weak. They developed an innovative software tool ('MolCompass') for this purpose and the results of this research approach have just been published in the prestigious Journal of Cheminformatics.


    Over the years, new pharmaceuticals and cosmetics have been tested on animals. These tests are expensive, raise ethical concerns, and often fail to accurately predict human reactions. Recently, the European Union supported the RISK-HUNT3R project to develop the next generation of non-animal risk assessment methods. The University of Vienna is a member of the project consortium. Computational methods now allow the toxicological and environmental risks of new chemicals to be assessed entirely by computer, without the need to synthesize the chemical compounds. But one question remains: How confident are these computer models?

    Want more breaking news?

    Subscribe to Technology Networks’ daily newsletter, delivering breaking science news straight to your inbox every day.

    Subscribe for FREE

    It's all about reliable prediction

    To address this issue, Sergey Sosnin, a senior scientist of the Pharmacoinformatics Research Group at the University of Vienna, focused on binary classification. In this context, a machine learning model provides a probability score from 0% to 100%, indicating whether a chemical compound is active or not (e.g., toxic or non-toxic, bioaccumulative or non-bioaccumulative, a binder or non-binder to a specific human protein). This probability reflects the confidence of the model in its prediction. Ideally, the model should be confident only in its correct predictions. If the model is uncertain, giving a confidence score around 51%, these predictions can be disregarded in favor of alternative methods. A challenge arises, however, when the model is fully confident in incorrect predictions.


    "This is the real nightmare scenario for a computational toxicologist," says Sergey Sosnin. "If a model predicts that a compound is non-toxic with 99% confidence, but the compound is actually toxic, there is no way to know that something was wrong." The only solution is to identify areas of ‘chemical space‘ – encompassing possible classes of organic compounds – where the model has ‘blind spots’ in advance and avoid them. To do this, a researcher evaluating the model must check the predicted results for thousands of chemical compounds one by one – a tedious and error-prone task.

    Overcoming this significant hurdle

    "To assist these researchers," Sosnin continues, "we developed interactive graphical tools that display chemical compounds onto a 2D plane, like geographical maps. Using colors, we highlight the compounds that were predicted incorrectly with high confidence, allowing users to identify them as clusters of red dots. The map is interactive, enabling users to investigate the chemical space and explore regions of concern."


    The methodology was proven using an estrogen receptor binding model. After visual analysis of the chemical space, it became clear that the model works well for e.g. steroids and polychlorinated biphenyls, but fails completely for small non-cyclic compounds and should not be used for them.


    The software developed in this project is freely available to the community on GitHub. Sergey Sosnin hopes that MolCompass will lead chemists and toxicologists to a better understanding of the limitations of computational models. This study is a step toward a future where animal testing is no longer necessary and the only workplace for a toxicologist is a computer desk.


    Reference: Sosnin S. MolCompass: multi-tool for the navigation in chemical space and visual validation of QSAR/QSPR models. J Cheminform. 2024;16(1):98. doi: 10.1186/s13321-024-00888-z


    This article has been republished from the following materialsArticle summaries may be generated using fact-checked AI models. Note: material may have been edited for length and content. For further information, please contact the cited source. Our press release publishing policy can be accessed here.