view article Article LoRA training scripts of the world, unite! linoyts, multimodalart • Jan 2, 2024 • 79
view article Article Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo ariG23498, aerdem4 • Dec 23, 2024 • 51
view article Article How good are LLMs at fixing their mistakes? A chatbot arena experiment with Keras and TPUs martin-gorner • Dec 5, 2024 • 14
view article Article 🔥 Argilla 2.0: the data-centric tool for AI makers 🤗 dvilasuero • Jul 30, 2024 • 39
PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking Paper • 2410.12375 • Published Oct 16, 2024 • 5
Standard-format-preference-dataset Collection We collect the open-source datasets and process them into the standard format. • 12 items • Updated Mar 2 • 26