BioCoder: a benchmark for bioinformatics code generation with large language models
Tang X, Qian B, Gao R, Chen J, Chen X, Gerstein M. BioCoder: a benchmark for bioinformatics code generation with large language models. Bioinformatics 2024, 40: i266-i276. PMID: 38940140, PMCID: PMC11211839, DOI: 10.1093/bioinformatics/btae230.Peer-Reviewed Original ResearchConceptsCode generationLanguage modelAmount of domain knowledgeDomain-specific knowledgeJava methodsDomain knowledgeClass declarationsPerformance gainsData operationsPython functionsTraining datasetSuccess modelIntricate taskTest benchmarksDocker imageBenchmarksCodeSmall modelsDatasetGlobal variablesBioCodeFunctional dependenceEvaluate various modelsIncreasing needCodeGen