Discussion about this post

User's avatar
Christian's avatar

Thanks for the literature overview!

As for the comparison to other methods: In my mind, instead of using a QSAR model, a transfer learning scenario would be more applicable. At least REINVENT uses iterative optimization (reinforcement learning) in contrast to pre-training (Enki) by default, so generating a mere 100 compounds will (almost) sample from the initial prior (or did you perform some sort of "warm-up" phase?) - so it is no surprise that you don't get any enrichment. Lastly, the choice of a 100 generated compounds seems extremely low: In a real-world application, when you optimize 10+ scoring components at the same time and not a single objective, it's quite likely that only a few of your 100 molecules will strike a balance that is appealing to a MedChemist

Expand full comment
4 more comments...

No posts