-
CLEX: Continuous Length Extrapolation for Large Language Models
Paper • 2310.16450 • Published • 10 -
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models
Paper • 2401.06951 • Published • 26 -
Data Engineering for Scaling Language Models to 128K Context
Paper • 2402.10171 • Published • 25
Juan Herrera
juampahc
AI & ML interests
None yet
Recent Activity
liked a model 5 days ago
OuteAI/Llama-OuteTTS-1.0-1B upvoted a collection 10 days ago
NVIDIA Nemotron v3 liked a model about 1 month ago
xkos/Qwen3-TTS-12Hz-1.7B-ONNX