SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science Paper โข 2505.13220 โข Published May 19 โข 4
GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation Paper โข 2505.20416 โข Published May 26 โข 6