LLM-Instruct - a lihaocruiser Collection

lihaocruiser 's Collections

LLM-SyntheticData

LLM-recomendation

LLM-Hallucination

LLM-Instruct

updated Oct 8, 2024

#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models

Paper • 2308.07074 • Published Aug 14, 2023
Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing

Paper • 2310.13855 • Published Oct 20, 2023 • 1
LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms

Paper • 2311.13133 • Published Nov 22, 2023
Group Preference Optimization: Few-Shot Alignment of Large Language Models

Paper • 2310.11523 • Published Oct 17, 2023
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning

Paper • 2312.01552 • Published Dec 4, 2023 • 32
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Paper • 2312.09390 • Published Dec 14, 2023 • 33
The Impact of Reasoning Step Length on Large Language Models

Paper • 2401.04925 • Published Jan 10, 2024 • 18
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines

Paper • 2310.03714 • Published Oct 5, 2023 • 37
When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27, 2024 • 26
Mixture-of-Instructions: Comprehensive Alignment of a Large Language Model through the Mixture of Diverse System Prompting Instructions

Paper • 2404.18410 • Published Apr 29, 2024 • 1
Self-Harmonized Chain of Thought

Paper • 2409.04057 • Published Sep 6, 2024 • 18
SurveySum: A Dataset for Summarizing Multiple Scientific Articles into a Survey Section

Paper • 2408.16444 • Published Aug 29, 2024 • 8
Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21, 2024 • 29