Independent, complex thinking not (yet) possible after all: Study led by TU shows limitations of ChatGPT & co.
Darmstadt, August 12, 2024. According to a new study led by TU Darmstadt, AI models such as ChatGPT are apparently less capable of learning independently than previously assumed. According to the study, there is no evidence that what are known as large language models (LLMs) are beginning to develop a general "intelligent" behaviour that would enable them to proceed in a planned or intuitive manner or to think in a complex way. The study will be presented in August at the annual conference of the renowned Association for Computational Linguistics (ACL) in Bangkok, the largest international conference on automatic language processing.
The research focuses on unforeseen and sudden leaps in the performance of language models, which are referred to as "emergent abilities". After the models were introduced, scientists found that they became more powerful with increasing size and the growing amount of data with which they were trained (scaling). As the tools were scaled up, they were able to solve a larger number of language-based tasks – for example, recognising fake news or drawing logical conclusions. On the one hand, this raised hopes that further scaling would make the models even better. On the other hand, there was also concern that these abilities could become dangerous, as the LLMs could become independent and possibly escape human control. In response, AI laws were introduced worldwide, including in the European Union and the USA.
However, the authors of the current study have now come to the conclusion that there is no evidence for the presumed development of differentiated thinking in the models. Instead, the LLMs acquired the superficial skill of following relatively simple instructions, as the researchers showed. The systems are still a long way from what humans are capable of. The study was led by TU computer science professor Iryna Gurevych and her colleague Dr Harish Tayyar Madabushi from the University of Bath in the UK.
"However, our results do not mean that AI is not a threat at all," emphasised Gurevych. "Rather, we show that the purported emergence of complex thinking skills associated with specific threats is not supported by evidence and that we can control the learning process of LLMs very well after all. Future research should therefore focus on other risks posed by the models, such as their potential to be used to generate fake news."
And what do the results mean for users of AI systems such as ChatGPT? "It is probably a mistake to rely on an AI model to interpret and execute complex tasks without help," explains Gurevych, who heads the Ubiquitous Knowledge Processing (UKP) Lab at the Computer Science Department of TU Darmstadt. "Instead, users should explicitly state what the systems should do and, if possible, give examples. The important thing is: The tendency of these models to produce plausible-sounding but false results – known as confabulation – is likely to persist, even if the quality of the models has improved dramatically in recent times."
About TU Darmstadt
TU Darmstadt is one of Germany’s leading technical universities and a synonym for excellent, relevant research. We are crucially shaping global transformations – from the energy transition via Industry 4.0 to artificial intelligence – with outstanding insights and forward-looking study opportunities. TU Darmstadt pools its cutting-edge research in three fields: Energy and Environment, Information and Intelligence, Matter and Materials. Our problem-based interdisciplinarity as well as our productive interaction with society, business and politics generate progress towards sustainable development worldwide. Since we were founded in 1877, we have been one of Germany’s most international universities; as a European technical university, we are developing a trans-European campus in the network, Unite! With our partners in the alliance of Rhine-Main universities – Goethe University Frankfurt and Johannes Gutenberg University Mainz – we further the development of the metropolitan region Frankfurt-Rhine-Main as a globally attractive science location.
MI No. 36e/2024, mih
Wissenschaftlicher Ansprechpartner:
Prof Dr Iryna Gurevych
Hochschulstraße 10, S2|02 B110, 64289 Darmstadt
Tel.: +49 (0) 6151 16 - 25290; e-mail: iryna.gurevych@tu-darmstadt.de
Originalpublikation:
Sheng Lu, Irina Bigoulaeva, Rachneet Sachdeva, Harish Tayyar Madabushi, Iryna Gurevych: Are Emergent Abilities in Large Language Models just In-Context Learning?
https://arxiv.org/abs/2309.01809
https://doi.org/10.48550/arXiv.2309.01809