Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench Paper • 2310.01386 • Published Oct 2, 2023
Exploring Human-Like Translation Strategy with Large Language Models Paper • 2305.04118 • Published May 6, 2023
Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models Paper • 2310.20499 • Published Oct 31, 2023 • 8
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate Paper • 2305.19118 • Published May 30, 2023
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher Paper • 2308.06463 • Published Aug 12, 2023 • 1
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments Paper • 2403.11807 • Published Mar 18, 2024
Adapters for Enhanced Modeling of Multilingual Knowledge and Text Paper • 2210.13617 • Published Oct 24, 2022
Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine Paper • 2301.08745 • Published Jan 20, 2023
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training Paper • 2407.09121 • Published Jul 12, 2024 • 6
All Languages Matter: On the Multilingual Safety of Large Language Models Paper • 2310.00905 • Published Oct 2, 2023
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates Paper • 2410.20814 • Published Oct 28, 2024
CoAct: A Global-Local Hierarchy for Autonomous Agent Collaboration Paper • 2406.13381 • Published Jun 19, 2024
Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought Paper • 2505.15431 • Published May 21 • 1
Revisiting the Reliability of Psychological Scales on Large Language Models Paper • 2305.19926 • Published May 31, 2023
DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper • 2510.21618 • Published Oct 24 • 99