Large Language Models and the Extended Church-Turing Thesis

Jiří Wiedermann*, Jan van Leeuwen*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

The Extended Church-Turing Thesis (ECTT) posits that all effective information processing, including unbounded and non-uniform interactive computations, can be described in terms of interactive Turing machines with advice. Does this assertion also apply to the abilities of contemporary large language models (LLMs)? From a broader perspective, this question calls for an investigation of the computational power of LLMs by the classical means of computability and computational complexity theory, especially the theory of automata. Along these lines, we establish a number of fundamental results. Firstly, we argue that any fixed (non-adaptive) LLM is computationally equivalent to a, possibly very large, deterministic finite-state transducer. This characterizes the base level of LLMs. We extend this to a key result concerning the simulation of space-bounded Turing machines by LLMs. Secondly, we show that lineages of evolving LLMs are computationally equivalent to interactive Turing machines with advice. The latter finding confirms the validity of the ECTT for lineages of LLMs. From a computability viewpoint, it also suggests that lineages of LLMs possess super-Turing computational power. Consequently, in our computational model knowledge generation is in general a non-algorithmic process realized by lineages of LLMs. Finally, we discuss the merits of our findings in the broader context of several related disciplines and philosophies.

Original languageEnglish
Title of host publicationProceedings 14th International Workshop on Non-Classical Models of Automata and Applications (NCMA 2024)
EditorsFlorin Manea, Giovanni Pighizzini
Place of PublicationGöttingen
Pages198-213
Number of pages16
DOIs
Publication statusPublished - 11 Sept 2024

Publication series

NameElectronic Proceedings in Theoretical Computer Science
Volume407

Bibliographical note

Publisher Copyright:
© Jiří Wiedermann and Jan van Leeuwen.

Funding

The research of the first author was partially supported by Grant No. CK04000150 EBAVEL of the Czech Technology Agency, programme Strategy AV21 \u201CPhilosophy and Artificial Intelligence\u201D, and the Karel \u010Capek Center for Values in Science and Technology.

FundersFunder number
Czech Technology Agency
Karel Čapek Center for Values in Science and Technology

    Fingerprint

    Dive into the research topics of 'Large Language Models and the Extended Church-Turing Thesis'. Together they form a unique fingerprint.

    Cite this