AI complement solves SAT geometry questions as good as normal tellurian exam taker

608 views Leave a comment

The Allen Institute for Artificial Intelligence (AI2) and University of Washington researchers have combined an synthetic comprehension (AI) complement that can solve SAT geometry questions as good as a normal American 11th-grade student, a breakthrough in AI research.

Image credit: Aaron Escobar,

Image credit: Aaron Escobar,

This system, called GeoS, uses a multiple of mechanism prophesy to conclude diagrams, healthy denunciation estimate to review and know content and a geometric solver to grasp 49 percent correctness on central SAT exam questions. If these formula were extrapolated to a whole Math SAT test, a mechanism roughly achieved an SAT magnitude of 500 (out of 800), a normal exam magnitude for 2015.

A paper surveying a research, “Solving Geometry Problems: Combining Text and Diagram Interpretation,” was a corner bid between a UW Computer Science Engineering dialect and AI2.

These results, presented during a 2015 Conference on Empirical Methods in Natural Language Processing (EMNLP) in Lisbon, Portugal, were achieved by GeoS elucidate unaltered SAT questions that it had never seen before and that compulsory an bargain of:

* Implicit relationships

* Ambiguous references

* The relations between diagrams and natural-language text

“Unlike a Turing Test, standardised tests such as a SAT yield us currently with a proceed to magnitude a machine’s ability to reason and to review a abilities with that of a human,” said Oren Etzioni, CEO of AI2. “Much of what we know from content and graphics is not categorically stated, and requires distant some-more believe than we appreciate. Creating a complement to be means to successfully take these tests is challenging, and we are unapproachable to grasp these rare results.”

Said Ali Farhadi, comparison investigate manager for Vision during AI2 and UW partner highbrow of mechanism scholarship and engineering, “We are vehement about GeoS’s opening on real-world tasks. Our biggest plea was converting a doubt to a computer-understandable language. One needs to go over customary pattern-matching approaches for problems like elucidate geometry questions that need in-depth bargain of text, blueprint and reasoning.”

How GeoS Works

GeoS is a initial end-to-end complement that solves SAT craft geometry problems. It does this by initial interpreting a geometry doubt by regulating a blueprint and content in unison to beget a best probable judicious expressions of a problem, that it sends to a geometric solver to solve. Then it compares that answer to a multiple-choice answers for that question.

A proof of a system’s problem-solving is accessible here.

This routine is difficult by a fact that SAT questions enclose many unstated assumptions.


For example, in a SAT problem during right, there are several unstated assumptions, such as a fact that lines BD and AC join during E, that “circle O has a radius of 5” is a same as “circle O radius equals 5” and that a sketch might or might not be to scale.

GeoS had a 96 percent correctness rate on questions it was assured adequate to answer, that is an critical dimension of learning. Today, GeoS can solve craft geometry questions; AI2 is relocating to solve a full set of SAT math questions in a subsequent 3 years.

As partial of AI2’s joining to pity a investigate for a common good, all information sets and program are accessible for other researchers to use.

AI2 is also building systems that can tackle scholarship tests, that need a believe bottom that includes elements of a unstated, common-sense believe that humans beget over their lives. This Aristo plan is described here.

Co-authors embody lead author Minjoon Seo, a UW mechanism scholarship and engineering doctoral student, UW electrical engineering partner investigate highbrow Hannaneh Hajishirzi, and former UW undergraduate tyro Clint Malcolm.

About AI2

AI2 was founded in 2014 with a unaccompanied concentration of conducting high-impact investigate and engineering in a margin of synthetic intelligence, all for a common good. AI2 is a origination of Paul Allen, Microsoft cofounder, and is led by Dr. Oren Etzioni, a eminent researcher in a margin of AI. AI2 employs some-more than 35 top-notch researchers and engineers, attracting people of sundry interests and backgrounds from opposite a globe. AI2 prides itself on a farrago and partnership of this team, and takes a results-oriented proceed to formidable hurdles in AI.

Source: University of Washington