BioinformaticsBench: A collaboratively built large language model benchmark for Bioinformatics reasoning
Published in ICML 2024, 2024
Most of the existing Large Language Model (LLM) benchmarks on bioinformatics problem reasoning focus on problems grounded to niche research domains where datasets contain a small number of samples and, therefore are not truly representative of the broad domain of bioinformatics. To systematically examine the reasoning capabilities required for solving complex bioinformatics problems, we introduce an expansive benchmark suite BioinformaticsBench for LLMs.
Citation: Varuni Sarwal, Seungmo Lee, Rosemary He, Aingela Kattapuram, xiaoxuan wang, Eleazar Eskin, Wei Wang, Serghei Mangul. "BioinformaticsBench: A collaboratively built large language model benchmark for Bioinformatics reasoning." AccMLBio ICML 2024.
Download Paper



