Skip to content

Cambridge Cheminformatics Newsletter, April/May 2023

Dear All,

I would like to circulate some current Cheminformatics- (and related) news to everyone as follows. If you have information from your side for distribution please just let me know, and I am happy to include it on the next occasion!

So here we go…

Events

7 June 2023
Cambridge Cheminformatics Meeting
Cambridge, UK and Online (Hybrid Mode)
http://www.c-inf.net

In person at the Cambridge Crystallographic Data Centre on Union Road
Registration (for Zoom attendance): https://zoom.us/meeting/register/tJcsceuqrzsiHtWRLbOsYTouSI00uGYzq81B

Programme

Structure-based Drug Design with Equivariant Diffusion Models
Charlie Harris, University of Cambridge

DECIMER: Deep Learning for Scraping, Curating and Registering Compounds From the Primary Literature
Kohulan Rajan, Jena University

Distributed HPC Workflows with Covalent
Will Cunningham, Agnostiq

15 June 2023
Berlin Modeller Stammtisch
Berlin, Germany
https://www.eventbrite.com/e/berlin-modeler-stammtisch-tickets-442932221197

19 June 2023
Cambridge Alliance on Medicines Safety (CAMS) Event: Translation of Safety in the Clinic
Virtual Event
https://www.eventbrite.co.uk/e/cambridge-alliance-of-medicines-safety-translation-of-safety-in-the-clinic-tickets-643084070767

26/27 June 2023
Industry Symposium on “AI in the Life Sciences”
Bonn, Germany and Remote
https://www.scai.fraunhofer.de/en/events/industry-symposium-ai-life-sciences.html

4/5 September 2023
6th Artificial Intelligence in Chemistry Symposium
Cambridge, UK
https://www.rscbmcs.org/events/aichem23

6 September 2023 – hold the date!
Cambridge Cheminformatics Meeting – Stay in Town after the AI in Chemistry Symposium!

19-22 September 2023
Summer School: Artificial Intelligence for Medicinal Chemistry and Drug Design
Muenster, Germany
https://www.uni-muenster.de/Chemie.pz/forschen/ag/koch/ai4medchem.html

Job Seekers

Laboratory digitalisation expert with sound knowledge in topics like FAIR Data management, Digital transformation and driving data driven decisions and culture. Main skills include project management, people management, software implementation life cycle, and laboratory automation. Notable knowledge of Pharma, Biotech and Drug discovery processes.

Please contact me and I will bring you in touch (andreas@drugdiscovery.net).

Vacancies

Tenure track position on AI and multiscale modeling in biology and bio-medicine
University of Montpellier
Montpellier, France
https://systems-biology-lphi.cnrs.fr/cpj/

Cheminformatics Senior Scientist, ADME
Atomwise
San Francisco Bay Area or Remote
https://www.atomwise.com/positions/?gh_jid=6682827002

Cheminformatician
Richter Gedeon
Budapest, Hungary
https://www.linkedin.com/jobs/view/3616347683

Principal Scientist Computational Chemistry
Cancer Research UK (CRUK)
Glasgow, Scotland
https://www.linkedin.com/jobs/view/3594554726

Computational Chemist
SoseiHeptares
Cambridge, UK
https://cezanneondemand.intervieweb.it/heptares/jobs/senior-scientist-i-computational-chemistcheminformatics-32302/en

Director Computational Molecular Design; Machine Learning Scientists
Bayer
Berlin, Germany; Cambridge/MA
https://www.linkedin.com/jobs/view/3609526707
https://jobs.bayer.com/job/Cambridge-Machine-Learning-Scientist-II-Mass/939666301
https://jobs.bayer.com/job/Cambridge-Machine-Learning-Scientist-Mass/939646601

IT Early Solutions Scientific Compute Lead
UCB
Braine-l’Alleud, Belgium
https://www.linkedin.com/jobs/view/3549529533

Head of Data Science
Wild Biotech
Tel Aviv, Israel
https://wildbio.tech/careers/

Software Engineer (Machine Learning)
Isomorphic Labs
London, UK
https://www.linkedin.com/jobs/view/3377842715

Senior Research Scientist I/II, Cheminformatics (Augmented Molecular Design)
Abbvie
Chicago, IL
https://careers.abbvie.com/en/job/chicago/senior-research-scientist-i-ii-cheminformatics-augmented-molecular-design/14/45348989280

Postdocs – Geometric Deep Learning and AI/ML for Image Analysis
Johnson&Johnson
Madrid, Spain and Beerse, Belgium
https://www.linkedin.com/jobs/view/3595279242
https://www.linkedin.com/jobs/view/3595277387

Internships Computational Toxicology  
Roche
Basel, Switzerland
https://careers.roche.com/global/en/job/ROCHGLOBAL202302104188EXTERNALENGLOBAL/Student-Internship-for-%E2%80%9CeTox-Data-Integration-and-Utilization%E2%80%9D-m-f-d;-from-May-2023-for-9-12-Months
https://careers.roche.com/global/en/job/ROCHGLOBAL202302104187EXTERNALENGLOBAL/Internship-for-Students-of-Bioinformatics-Biomedicine-or%C2%A0-Computational-Sciences-with-Interest-Toxicology-in-the-Field-of-%E2%80%9CSafety-In-Silico-Prediction-SISP-Catalog-Development%E2%80%9D-m-f-d-from-May-2023-for-9-12-Months

Cheminformatics…

Spring 2023 UK-QSAR Newsletter

News from in and around the field

chemfp 4.1 is out
https://chemfp.readthedocs.io/en/latest/index.html
“chemfp is a set of command-line tools and a Python package for working with binary cheminformatics fingerprints, typically between several hundred and a few thousand bits long.”

Breeze 2.0: an interactive web-tool for visual analysis and comparison of drug response data

https://academic.oup.com/nar/advance-article/doi/10.1093/nar/gkad390/7161532

So is it a hit, or not?

Benchmarking Refined and Unrefined AlphaFold2 Structures for Hit Discovery
https://pubs.acs.org/doi/10.1021/acs.jcim.2c01219
“[…] we find that unrefined AF2 structures deliver similar enrichments to that of an apo experimentally derived structure, significantly below the enrichments using an experimentally derived holo structure. Meanwhile, the application of IFD-MD [an induced-fit protocol] can induce a binding site conformation that delivers enrichments much closer to the holo structure.”

Are Deep Learning Structural Models Sufficiently Accurate for Virtual Screening? Application of Docking Algorithms to AlphaFold2 Predicted Structures
https://pubs.acs.org/doi/10.1021/acs.jcim.2c01270
“…we strongly recommend to include some post-processing modeling to drive the binding site into a more realistic holo model” (always good if studies agree!)

Infinite Physical Monkey: Do Deep Learning Methods Really Perform Better in Conformation Generation?
https://www.biorxiv.org/content/10.1101/2023.03.08.531607v2
Sample sizes matter when evaluating docking and scoring

ChemCrow: Augmenting large-language models with chemistry tools
https://arxiv.org/abs/2304.05376
Interfacing domain knowledge with Large Language Models

“BigSolDB contains 54273 individual solubility values, 830 unique molecules and 139 individual solvents for the temperature range from 243.15 to 403.15K and at atmospheric pressure”
http://bigsoldbapp.cheminfo.space/
Thanks for making this information available – ideally pH would be added in future versions of the database

“Interpretable Machine Learning for Science with PySR and SymbolicRegression.jl”
https://www.linkedin.com/posts/milescranmer_three-years-ago-i-started-working-on-an-activity-7063610055761223681-QUW_
As the title says (but be aware of biases of your chemical datasets!)

Using Transcriptomics and Cell Morphology Data in Drug Discovery: The Long Road to Practice
https://pubs.acs.org/doi/full/10.1021/acsmedchemlett.3c00015
Might be useful to those interested in systems-based readouts and their use for drug discovery

… beyond Cheminformatics …

Catalyzing next-generation Artificial Intelligence through NeuroAI
https://pubmed.ncbi.nlm.nih.gov/36949048
The next big thing to come?

Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
https://arxiv.org/abs/2305.14342
Adam is outdated… it’s time for Sophia!

The fallacy of the average: on the ubiquity, utility and continuing novelty of Jensen’s inequality
https://journals.biologists.com/jeb/article/220/2/139/18635/The-fallacy-of-the-average-on-the-ubiquity-utility
“On average two people walk over a bridge… but actually one is flying, and one is drowning” (one of my favourite sayings)

60 Years of Artificial Intelligence at Stanford
https://www.youtube.com/watch?v=Cn6nmWlu1EA
As the title says  

OpenAI Knows GPT-4 Is Dangerous—but Won’t Do a Damn Thing About It
https://www.thedailybeast.com/openai-knows-gpt-4-is-dangerousbut-wont-do-a-damn-thing-about-it
This reads quite ‘old’ in a way already despite being just two months old, but it doesn’t mean it’s entirely outdated

Denied by AI: How Medicare Advantage plans use algorithms to cut off care for seniors in need
https://www.statnews.com/2023/03/13/medicare-advantage-plans-denial-artificial-intelligence
Data and algorithms are one thing, validation and deployment another

Humility in Design May Be Hubris in Science: Reflections on the Problem of Slodderwetenschap (Sloppy Science)
https://www.sciencedirect.com/science/article/pii/S2405872621001064
Science isn’t easy if you really think about it

Laws of Tech: Commoditize Your Complement
https://gwern.net/complement
What Microsoft did with IBM, and why (some) commercial companies are only too happy to contribute to open source projects

… and really beyond Cheminformatics

“The expert who pioneered ‘quantitative easing’ has seen enough: Central banks are too powerful and they’re to blame for inflation”
https://fortune.com/2023/03/20/is-federal-reserve-too-powerful-inflation-quantitative-easing-richard-werner
And yes, inflation we have

Language Model Validation Special Section:

GPT-4 absolutely crushes Bard in a rap battle
https://twitter.com/mehran__jalali/status/1639846978850021377
I would agree with that judgement!

And it is even learning to…
https://twitter.com/sleepinyourhat/status/1638988283018465300
… draw unicorns in TikZ 🙂

I believe this is all from my side for now – if you have any information for me to circulate, or wish to present at one of our next Cambridge Cheminformatics or Digital Science for Drug Discovery Meetings, please just let me know, cheers!

Best wishes,

Andreas

Leave a Reply

Your email address will not be published. Required fields are marked *