Abstract

Large language models (LLMs) are transforming broad research areas, yet concerns about their trustworthiness remain. This study explored the use of Retrieval-Augmented Generation (RAG) to improve LLMs' knowledge extraction in the field of mealworm-mediated plastic degradation. We integrated publications up to June 2024 (75 papers) to evaluate the model performance using a curated data set of 100 queries. GraphRAG, LightRAG, and a traditional RAG were examined with five LLM models (GPT-4o, GPT-5, Deepseek-V3.1, Qwen-plus, and Llama-3.3). Our results reveal that LightRAG improved LLMs the most in information extraction. Specifically, for quantitative information extraction, the best performing RAG + LLM pipeline achieves over 92% accuracy. Meanwhile, for open-ended queries, LightRAG + Llama answers the questions with the best balance of precision and information coverage. Moreover, empirical results validated the answers about the mealworm gut microbiome composition and plastic deconstruction patterns through the LightRAG + Llama pipeline. In designing plastic biodegradation experiments, the original LLMs outperformed RAG-trained LLMs. The expandable nature of RAG enables timely updates to the knowledge base. This study demonstrates a reliable application of advanced LLMs in the emerging environmental science field. Our findings identify challenges, such as conflict handling, and guide future research in scientific artificial intelligence.

Affiliated Institutions

Related Publications

UniProt: the universal protein knowledgebase

The UniProt knowledgebase is a large resource of protein sequences and associated detailed annotation. The database contains over 60 million sequences, of which over half a mill...

2016 Nucleic Acids Research 4500 citations

Publication Info

Year
2025
Type
article
Citations
0
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

0
OpenAlex

Cite This

Bingdi Liu, Wenyu Li, Runyu Zhao et al. (2025). Leveraging Retrieval-Augmented Generation to Accelerate Discoveries on Mealworm Larvae and Plastic Degradation. Environmental Science & Technology . https://doi.org/10.1021/acs.est.5c14258

Identifiers

DOI
10.1021/acs.est.5c14258