Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Shruti Patel's picture

1

Shruti Patel

shrutipatel

Yoai's profile picture

·

AI & ML interests

None yet

Organizations

shrutipatel 's collections 8

coding with llms

Chain-of-Verification Reduces Hallucination in Large Language Models

Paper • 2309.11495 • Published Sep 20, 2023 • 39
CodePlan: Repository-level Coding using LLMs and Planning

Paper • 2309.12499 • Published Sep 21, 2023 • 79
SCREWS: A Modular Framework for Reasoning with Revisions

Paper • 2309.13075 • Published Sep 20, 2023 • 17
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Paper • 2309.08532 • Published Sep 15, 2023 • 53

logical reasoning with llms

Language Models can be Logical Solvers

Paper • 2311.06158 • Published Nov 10, 2023 • 22
Fusion-Eval: Integrating Evaluators with LLMs

Paper • 2311.09204 • Published Nov 15, 2023 • 6
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation

Paper • 2311.08877 • Published Nov 15, 2023 • 7
Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

Paper • 2311.07587 • Published Nov 8, 2023 • 5

JaxMARL: Multi-Agent RL Environments in JAX

Paper • 2311.10090 • Published Nov 16, 2023 • 8
ToolTalk: Evaluating Tool-Usage in a Conversational Setting

Paper • 2311.10775 • Published Nov 15, 2023 • 10
Contrastive Chain-of-Thought Prompting

Paper • 2311.09277 • Published Nov 15, 2023 • 36
Testing Language Model Agents Safely in the Wild

Paper • 2311.10538 • Published Nov 17, 2023 • 11

diffusion models

The Chosen One: Consistent Characters in Text-to-Image Diffusion Models

Paper • 2311.10093 • Published Nov 16, 2023 • 59

Large-Scale Automatic Audiobook Creation

Paper • 2309.03926 • Published Sep 7, 2023 • 55

automl with llms

ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks

Paper • 2311.09835 • Published Nov 16, 2023 • 11

MultiLoRA: Democratizing LoRA for Better Multi-Task Learning

Paper • 2311.11501 • Published Nov 20, 2023 • 37

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 39

coding with llms

Chain-of-Verification Reduces Hallucination in Large Language Models

Paper • 2309.11495 • Published Sep 20, 2023 • 39
CodePlan: Repository-level Coding using LLMs and Planning

Paper • 2309.12499 • Published Sep 21, 2023 • 79
SCREWS: A Modular Framework for Reasoning with Revisions

Paper • 2309.13075 • Published Sep 20, 2023 • 17
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Paper • 2309.08532 • Published Sep 15, 2023 • 53

Large-Scale Automatic Audiobook Creation

Paper • 2309.03926 • Published Sep 7, 2023 • 55

logical reasoning with llms

Language Models can be Logical Solvers

Paper • 2311.06158 • Published Nov 10, 2023 • 22
Fusion-Eval: Integrating Evaluators with LLMs

Paper • 2311.09204 • Published Nov 15, 2023 • 6
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation

Paper • 2311.08877 • Published Nov 15, 2023 • 7
Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

Paper • 2311.07587 • Published Nov 8, 2023 • 5

automl with llms

ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks

Paper • 2311.09835 • Published Nov 16, 2023 • 11

JaxMARL: Multi-Agent RL Environments in JAX

Paper • 2311.10090 • Published Nov 16, 2023 • 8
ToolTalk: Evaluating Tool-Usage in a Conversational Setting

Paper • 2311.10775 • Published Nov 15, 2023 • 10
Contrastive Chain-of-Thought Prompting

Paper • 2311.09277 • Published Nov 15, 2023 • 36
Testing Language Model Agents Safely in the Wild

Paper • 2311.10538 • Published Nov 17, 2023 • 11

MultiLoRA: Democratizing LoRA for Better Multi-Task Learning

Paper • 2311.11501 • Published Nov 20, 2023 • 37

diffusion models

The Chosen One: Consistent Characters in Text-to-Image Diffusion Models

Paper • 2311.10093 • Published Nov 16, 2023 • 59

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 39

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs