Jump to content

Proof assistant

(Redirected from Automated theorem checking)
An interactive proof session in CoqIDE, showing the proof script on the left and the proof state on the right

In computer science and mathematical logic, a proof assistant or interactive theorem prover is a software tool to assist with the development of formal proofs by human–machine collaboration. This involves some sort of interactive proof editor, or other interface, with which a human can guide the search for proofs, the details of which are stored in, and some steps provided by, a computer.

A recent effort within this field is making these tools use artificial intelligence to automate the formalization of ordinary mathematics.[1]

System comparison

[edit]
NameLatest versionDeveloper(s)Implementation languageFeatures
Higher-order logicDependent typesSmall kernelProof automationProof by reflectionCode generation
ACL28.3Matt Kaufmann and J Strother MooreCommon LispNoUntypedNoYesYes[2]Already executable
Agda2.6.4.3[3]Ulf Norell, Nils Anders Danielsson, and Andreas Abel (Chalmers and Gothenburg)[3]Haskell[3]Yes[citation needed]Yes[4]Yes[citation needed]No[citation needed]Partial[citation needed]Already executable[citation needed]
Albatross0.4Helmut BrandlOCamlYesNoYesYesUn­knownNot yet Implemented
Coq8.19.0INRIAOCamlYesYesYesYesYesYes
F*repositoryMicrosoft Research and INRIAF*YesYesNoYesYes[5]Yes
HOL LightrepositoryJohn HarrisonOCamlYesNoYesYesNoNo
HOL4Kananaskis-13 (or repo)Michael Norrish, Konrad Slind, and othersStandard MLYesNoYesYesNoYes
Idris2 0.6.0.Edwin BradyIdrisYesYesYesUn­knownPartialYes
IsabelleIsabelle2024 (May 2024)Larry Paulson (Cambridge), Tobias Nipkow (München) and Makarius WenzelStandard ML, ScalaYesNoYesYesYesYes
Leanv4.7.0[6]Leonardo de Moura (Microsoft Research)C++, LeanYesYesYesYesYesYes
LEGO (not affiliated with Lego)1.3.1Randy Pollack (Edinburgh)Standard MLYesYesYesNoNoNo
Metamathv0.198[7]Norman MegillANSI C
Mizar8.1.11Białystok UniversityFree PascalPartialYesNoNoNoNo
Nqthm
NuPRL5Cornell UniversityCommon LispYesYesYesYesUn­knownYes
PVS6.0SRI InternationalCommon LispYesYesNoYesNoUn­known
Twelf1.7.1Frank Pfenning and Carsten SchürmannStandard MLYesYesUn­knownNoNoUn­known
  • ACL2 – a programming language, a first-order logical theory, and a theorem prover (with both interactive and automatic modes) in the Boyer–Moore tradition.
  • Coq – Allows the expression of mathematical assertions, mechanically checks proofs of these assertions, helps to find formal proofs, and extracts a certified program from the constructive proof of its formal specification.
  • HOL theorem provers – A family of tools ultimately derived from the LCF theorem prover. In these systems the logical core is a library of their programming language. Theorems represent new elements of the language and can only be introduced via "strategies" which guarantee logical correctness. Strategy composition gives users the ability to produce significant proofs with relatively few interactions with the system. Members of the family include:
  • IMPS, An Interactive Mathematical Proof System.[8]
  • Isabelle is an interactive theorem prover, successor of HOL. The main code-base is BSD-licensed, but the Isabelle distribution bundles many add-on tools with different licenses.
  • Jape – Java based.
  • Lean
  • LEGO
  • Matita – A light system based on the Calculus of Inductive Constructions.
  • MINLOG – A proof assistant based on first-order minimal logic.
  • Mizar – A proof assistant based on first-order logic, in a natural deduction style, and Tarski–Grothendieck set theory.
  • PhoX – A proof assistant based on higher-order logic which is eXtensible.
  • Prototype Verification System (PVS) – a proof language and system based on higher-order logic.
  • TPS and ETPS – Interactive theorem provers also based on simply-typed lambda calculus, but based on an independent formulation of the logical theory and independent implementation.

User interfaces

[edit]

A popular front-end for proof assistants is the Emacs-based Proof General, developed at the University of Edinburgh.

Coq includes CoqIDE, which is based on OCaml/Gtk. Isabelle includes Isabelle/jEdit, which is based on jEdit and the Isabelle/Scala infrastructure for document-oriented proof processing. More recently, Visual Studio Code extensions have been developed for Coq,[9] Isabelle by Makarius Wenzel,[10] and for Lean 4 by the leanprover developers.[11]

Formalization extent

[edit]

Freek Wiedijk has been keeping a ranking of proof assistants by the amount of formalized theorems out of a list of 100 well-known theorems. As of September 2023, only five systems have formalized proofs of more than 70% of the theorems, namely Isabelle, HOL Light, Coq, Lean, and Metamath.[12][13]

Notable formalized proofs

[edit]

The following is a list of notable proofs that have been formalized within proof assistants.

TheoremProof assistantYear
Four color theorem[14]Coq2005
Feit–Thompson theorem[15]Coq2012
Fundamental group of the circle[16]Coq2013
Erdős–Graham problem[17][18]Lean2022
Polynomial Freiman-Ruzsa conjecture over [19]Lean2023
BB(5) = 47,176,870[20]Coq2024

See also

[edit]

Notes

[edit]
  1. ^ Ornes, Stephen (August 27, 2020). "Quanta Magazine – How Close Are Computers to Automating Mathematical Reasoning?".
  2. ^ Hunt, Warren; Matt Kaufmann; Robert Bellarmine Krug; J Moore; Eric W. Smith (2005). "Meta Reasoning in ACL2" (PDF). Theorem Proving in Higher Order Logics. Lecture Notes in Computer Science. Vol. 3603. pp. 163–178. doi:10.1007/11541868_11. ISBN 978-3-540-28372-0.
  3. ^ Jump up to: a b c "agda/agda: Agda is a dependently typed programming language / interactive theorem prover". GitHub. Retrieved 31 July 2024.
  4. ^ "The Agda Wiki". Retrieved 31 July 2024.
  5. ^ Search for "proofs by reflection": arXiv:1803.06547
  6. ^ "Lean 4 Releases Page". GitHub. Retrieved 15 October 2023.
  7. ^ "Release v0.198 · metamath/Metamath-exe". GitHub.
  8. ^ Farmer, William M.; Guttman, Joshua D.; Thayer, F. Javier (1993). "IMPS: An interactive mathematical proof system". Journal of Automated Reasoning. 11 (2): 213–248. doi:10.1007/BF00881906. S2CID 3084322. Retrieved 22 January 2020.
  9. ^ "coq-community/vscoq". July 29, 2024 – via GitHub.
  10. ^ Wenzel, Makarius. "Isabelle". Retrieved 2 November 2019.
  11. ^ "VS Code Lean 4". GitHub. Retrieved 15 October 2023.
  12. ^ Wiedijk, Freek (15 September 2023). "Formalizing 100 Theorems".
  13. ^ Geuvers, Herman (February 2009). "Proof assistants: History, ideas and future". Sādhanā. 34 (1): 3–25. doi:10.1007/s12046-009-0001-5. hdl:2066/75958. S2CID 14827467.
  14. ^ Gonthier, Georges (2008), "Formal Proof—The Four-Color Theorem" (PDF), Notices of the American Mathematical Society, 55 (11): 1382–1393, MR 2463991, archived (PDF) from the original on 2011-08-05
  15. ^ "Feit thomson proved in coq - Microsoft Research Inria Joint Centre". 2016-11-19. Archived from the original on 2016-11-19. Retrieved 2023-12-07.
  16. ^ Licata, Daniel R.; Shulman, Michael (2013). "Calculating the Fundamental Group of the Circle in Homotopy Type Theory". 2013 28th Annual ACM/IEEE Symposium on Logic in Computer Science. pp. 223–232. arXiv:1301.3443. doi:10.1109/lics.2013.28. ISBN 978-1-4799-0413-6. S2CID 5661377. Retrieved 2023-12-07.
  17. ^ "Math Problem 3,500 Years In The Making Finally Gets A Solution". IFLScience. 2022-03-11. Retrieved 2024-02-09.
  18. ^ Avigad, Jeremy (2023). "Mathematics and the formal turn". arXiv:2311.00007 [math.HO].
  19. ^ Sloman, Leila (2023-12-06). "'A-Team' of Math Proves a Critical Link Between Addition and Sets". Quanta Magazine. Retrieved 2023-12-07.
  20. ^ "We have proved "BB(5) = 47,176,870"". The Busy Beaver Challenge. 2024-07-02. Retrieved 2024-07-09.

References

[edit]
[edit]
Catalogues
Arc.Ask3.Ru: конец переведенного документа.
Arc.Ask3.Ru
Номер скриншота №: a5d500b14e5e276b94beb871ca96ed22__1722573540
URL1:https://arc.ask3.ru/arc/aa/a5/22/a5d500b14e5e276b94beb871ca96ed22.html
Заголовок, (Title) документа по адресу, URL1:
Proof assistant - Wikipedia
Данный printscreen веб страницы (снимок веб страницы, скриншот веб страницы), визуально-программная копия документа расположенного по адресу URL1 и сохраненная в файл, имеет: квалифицированную, усовершенствованную (подтверждены: метки времени, валидность сертификата), открепленную ЭЦП (приложена к данному файлу), что может быть использовано для подтверждения содержания и факта существования документа в этот момент времени. Права на данный скриншот принадлежат администрации Ask3.ru, использование в качестве доказательства только с письменного разрешения правообладателя скриншота. Администрация Ask3.ru не несет ответственности за информацию размещенную на данном скриншоте. Права на прочие зарегистрированные элементы любого права, изображенные на снимках принадлежат их владельцам. Качество перевода предоставляется как есть. Любые претензии, иски не могут быть предъявлены. Если вы не согласны с любым пунктом перечисленным выше, вы не можете использовать данный сайт и информация размещенную на нем (сайте/странице), немедленно покиньте данный сайт. В случае нарушения любого пункта перечисленного выше, штраф 55! (Пятьдесят пять факториал, Денежную единицу (имеющую самостоятельную стоимость) можете выбрать самостоятельно, выплаичвается товарами в течение 7 дней с момента нарушения.)