netzkontrast
's Collections
Dolphin: Closed-loop Open-ended Auto-research through Thinking,
Practice, and Feedback
Paper
•
2501.03916
•
Published
•
16
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta
Chain-of-Though
Paper
•
2501.04682
•
Published
•
99
Agent Laboratory: Using LLM Agents as Research Assistants
Paper
•
2501.04227
•
Published
•
95
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper
•
2501.05366
•
Published
•
102
Entropy-Guided Attention for Private LLMs
Paper
•
2501.03489
•
Published
•
14
Enabling Scalable Oversight via Self-Evolving Critic
Paper
•
2501.05727
•
Published
•
72
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains
Paper
•
2501.05707
•
Published
•
20
SPAM: Spike-Aware Adam with Momentum Reset for Stable LLM Training
Paper
•
2501.06842
•
Published
•
16
Diving into Self-Evolving Training for Multimodal Reasoning
Paper
•
2412.17451
•
Published
•
42
Fourier Position Embedding: Enhancing Attention's Periodic Extension for
Length Generalization
Paper
•
2412.17739
•
Published
•
41
ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing
Paper
•
2412.14711
•
Published
•
16
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
Paper
•
2412.21187
•
Published
•
40
ProgCo: Program Helps Self-Correction of Large Language Models
Paper
•
2501.01264
•
Published
•
26
Tensor Product Attention Is All You Need
Paper
•
2501.06425
•
Published
•
90
Transformer^2: Self-adaptive LLMs
Paper
•
2501.06252
•
Published
•
54
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper
•
2501.08313
•
Published
•
300
Evolving Deeper LLM Thinking
Paper
•
2501.09891
•
Published
•
115
Agentic Context Engineering: Evolving Contexts for Self-Improving
Language Models
Paper
•
2510.04618
•
Published
•
127
Sculptor: Empowering LLMs with Cognitive Agency via Active Context
Management
Paper
•
2508.04664
•
Published
•
13
AgentFold: Long-Horizon Web Agents with Proactive Context Management
Paper
•
2510.24699
•
Published
•
69
Memory as Action: Autonomous Context Curation for Long-Horizon Agentic
Tasks
Paper
•
2510.12635
•
Published
•
16