oceansweep 's Collections Relevant-Papers-Midterm
updated
Same Task, More Tokens: the Impact of Input Length on the Reasoning
Performance of Large Language Models
Paper
• 2402.14848
• Published
• 19
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper
• 2406.06608
• Published
• 68
CRAG -- Comprehensive RAG Benchmark
Paper
• 2406.04744
• Published
• 46
Transformers meet Neural Algorithmic Reasoners
Paper
• 2406.09308
• Published
• 44
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal
Language Models
Paper
• 2406.09403
• Published
• 23
Interpreting the Weight Space of Customized Diffusion Models
Paper
• 2406.09413
• Published
• 20
OpenVLA: An Open-Source Vision-Language-Action Model
Paper
• 2406.09246
• Published
• 43
Alleviating Distortion in Image Generation via Multi-Resolution
Diffusion Models
Paper
• 2406.09416
• Published
• 29
An Image is Worth More Than 16x16 Patches: Exploring Transformers on
Individual Pixels
Paper
• 2406.09415
• Published
• 51
Paper
• 2406.09414
• Published
• 103
Large Language Model Confidence Estimation via Black-Box Access
Paper
• 2406.04370
• Published
• 22
DataComp-LM: In search of the next generation of training sets for
language models
Paper
• 2406.11794
• Published
• 55
Florence-2: Advancing a Unified Representation for a Variety of Vision
Tasks
Paper
• 2311.06242
• Published
• 95
Breaking the Attention Bottleneck
Paper
• 2406.10906
• Published
• 4
Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of
Multimodal Large Language Models
Paper
• 2406.11230
• Published
• 33
google/xtr-base-multilingual
0.3B • Updated
• 19
• 9
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs
Paper
• 2406.15319
• Published
• 64
Agentless: Demystifying LLM-based Software Engineering Agents
Paper
• 2407.01489
• Published
• 65
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Paper
• 2407.01370
• Published
• 89
LiteSearch: Efficacious Tree Search for LLM
Paper
• 2407.00320
• Published
• 40
Lookback Lens: Detecting and Mitigating Contextual Hallucinations in
Large Language Models Using Only Attention Maps
Paper
• 2407.07071
• Published
• 12
AgentInstruct: Toward Generative Teaching with Agentic Flows
Paper
• 2407.03502
• Published
• 51
LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference
Paper
• 2407.14057
• Published
• 46
BABILong: Testing the Limits of LLMs with Long Context
Reasoning-in-a-Haystack
Paper
• 2406.10149
• Published
• 52
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for
Reinforcement Learning and Monte-Carlo Tree Search
Paper
• 2408.08152
• Published
• 61
Why Does the Effective Context Length of LLMs Fall Short?
Paper
• 2410.18745
• Published
• 17
arcee-ai/SuperNova-Medius-GGUF
15B • Updated
• 1.21k
• 63