Logic-of-thought: Injecting logic into contexts for full reasoning in large language models

T Liu, W Xu, W Huang, X Wang, J Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
T Liu, W Xu, W Huang, X Wang, J Wang, H Yang, J Li
arXiv preprint arXiv:2409.17539, 2024arxiv.org
Large Language Models (LLMs) have demonstrated remarkable capabilities across various
tasks but their performance in complex logical reasoning tasks remains unsatisfactory.
Although some prompting methods, such as Chain-of-Thought, can improve the reasoning
ability of LLMs to some extent, they suffer from an unfaithful issue where derived conclusions
may not align with the generated reasoning chain. To address this issue, some studies
employ the approach of propositional logic to further enhance logical reasoning abilities of …
Large Language Models (LLMs) have demonstrated remarkable capabilities across various tasks but their performance in complex logical reasoning tasks remains unsatisfactory. Although some prompting methods, such as Chain-of-Thought, can improve the reasoning ability of LLMs to some extent, they suffer from an unfaithful issue where derived conclusions may not align with the generated reasoning chain. To address this issue, some studies employ the approach of propositional logic to further enhance logical reasoning abilities of LLMs. However, the potential omissions in the extraction of logical expressions in these methods can cause information loss in the logical reasoning process, thereby generating incorrect results. To this end, we propose Logic-of-Thought (LoT) prompting which employs propositional logic to generate expanded logical information from input context, and utilizes the generated logical information as an additional augmentation to the input prompts, thereby enhancing the capability of logical reasoning. The LoT is orthogonal to existing prompting methods and can be seamlessly integrated with them. Extensive experiments demonstrate that LoT boosts the performance of various prompting methods with a striking margin across five logical reasoning tasks. In particular, the LoT enhances Chain-of-Thought's performance on the ReClor dataset by +4.35%; moreover, it improves Chain-of-Thought with Self-Consistency's performance on LogiQA by +5%; additionally, it boosts performance of Tree-of-Thoughts on ProofWriter dataset by +8%.
arxiv.org