Study by Apple AI Scientists Reveals LLMs Lack Reasoning Abilities! 📊

TheJoker · October 14, 2024, 4:01pm

Summary:

LLMs and Reasoning Deficiency
A recent study from Apple’s AI team found that large language models (LLMs) from companies like Meta and OpenAI lack basic reasoning skills.
New Benchmark Proposal
The researchers introduced a new benchmark called GSM-Symbolic to evaluate the reasoning capabilities of various LLMs.
Inconsistent Answers
Initial tests showed that minor changes in query wording led to vastly different answers, raising concerns about the models’ reliability.
Fragility in Mathematical Reasoning
The study highlighted that adding contextual information to math questions could decrease accuracy by up to 65%, indicating the models’ inability to reason effectively.
Pattern Matching Explained
The researchers concluded that LLM behavior is better described as sophisticated pattern matching, which is sensitive to changes in input, affecting output consistency.

Read more at: Apple Insider | ArXiv Paper