## Admission blues: How to fix GRE Mathematics and tweak the Putnam Competition

I was thinking about the Putnam competition and the GRE Mathematics test in the context of graduate admissions. Are they useful? If yes, which one is more relevant? After crunching some numbers, I concluded that while they are useful to some extend, there are problems with both. Even worse, a number of students who fall in the gap between “very good” and “exceptional”, are ill served with either.

#### 1. Graduate admissions in mathematics

As I mention in my earlier post, every year the US produces around 1,600 Ph.D.’s in mathematical sciences (math, applied math, statistics) from over 100 accredited programs, of which about 900 are US citizen and permanent residents. If you restrict to mathematics alone, the numbers drop by about 25% to about 1200. The overall 10 year completion rate is about 50% according to the Council of Graduate Schools study, so perhaps about 3,000-3,200 students start graduate programs.

As a general rule, graduate programs in mathematics explicitly ask for the GRE Subject test scores, but are often happy to hear about the Putnam results as well. In fact, some “how to” guides now recommend taking Putnam exam (and Putnam prep classes!) on par with the GRE test and REU programs (see e.g. here and there). How the schools use either data is probably quite a bit different, and is the other side of our main question.

#### 2. GRE Mathematics Subject test in numbers

The GRE Subject tests are developed and administered by ETS, which is nominally non-profit, but with about 1 billion dollars in revenue. For a quick comparison with a for-profit, non-profit and public institutions, e.g. New York Times Corp, Harvard and UCLA, had 2.3, 3.7 and 4.3 bln dollars in 2011 operating revenues, respectively.

From the official GRE test preparation publication: “The questions are classified approximately as follows: *calculus* (50%), *algebra* (25%) and *other topics* (25%).” This is already unfortunate, but more on that later. Here are these “other topics”:

Introductory real analysis(sequences and series of numbers and functions, continuity, differentiability and integrability, elementary topology ofR),discrete mathematics(logic, set theory, combinatorics, graph theory, and algorithms),general topology,geometry,complex variables,probability and statistics, andnumerical analysis. The above descriptions of topics covered in the test should not be considered exhaustive […] (emphasis mine – IP)

The GRE Guide gives .92 value for the KR20 reliability test, a solid measure suggesting the test has many questions leading to different scores between strong and weak students. The students have 170 minutes for about 65 questions. The scores are on the scale from 200 to 990, are rounded to nearest multiple of 10, with standard errors 31 points, and 44 for the differences. In other words, if I understand correctly (the guide is vague on this), one should not reliably compare students with scores differing by 50 points of less. I am doubtful most grad schools follow that.

In the same GRE guide, ETS reports that there were about 12,800 test takers in four years (July 2008 to June 2011), roughly 3200 a year. This loosely coincides with our graduate student data, as the students take on average one GRE Subject test. In other words, all students with GRE scores get accepted *somewhere*. So one should not be surprised to see a high correlation (but not necessarily causation) between grad school ranking and GRE Subject scores. Curiously, ETS’s own study says GRE General are a very poor predictor of success in math graduate programs, at least when it comes to GPA and graduation rate.

So how do grad schools use the GRE Math scores? That’s very much unclear. Of course, all schools gather the statistics like averages of those applied, admitted and/or accepted (reported to the dean, external department reviewers, the NRC study, the US News, etc.), but very few make it publicly available. In a rare moment of openness, Penn State admits what amounts to *not much use of *GRE *scores*: their average scores vary widely over the years, swinging from 650 to 890, with a positive trend in recent years. In a general MO discussion on this, Pete Clark writes that University of Georgia does not require GRE Subject, so he looks for high GRE General scores. UCLA is a bit evasive: “those we offer admission to have GRE subject scores in or above the 80th percentile” which according to GRE chart amounts to minimum of about 790, suggesting relevance. MIT is blunt but imprecise: “There is no minimum GRE test score required, but if the score on the math subject GRE is not very high, evidence of excellence must be present elsewhere in the application or in the letters of recommendation.” UPenn is actually helpful: “[GRE Math score] should be at least 750, though applicants with somewhat lower scores may be admitted if the rest of their application is sufficiently strong,” and that the recent average score is 820. This all makes a very foggy picture.

#### 3. Putnam competition in numbers

The premise is simple: first Saturday in December, 6 hours (in two sittings) to solve 12 problems in all areas of mathematics, maximum of 10 points per problem. Joe Gallian wrote a nice summary. The problems are difficult: the maximal score 120 is achieved only very occasionally, once in about 10-15 years. The median score is often either 0, 1 or 2 (out of 120!), and the mean is between 5 and 10 points. I bet it must be depressing to spend 6 hours and get no or almost no points.

The top 5 scorers are “Putnam Fellows”, another 18-20 are “in the money”, and about 50-60 get “honorable mention”. In 2011, there were “4,440 students from 572 colleges and universities in Canada and the United States”. The historical data shows that there is a clear correlation between doing well on Putnam and doing well in mathematics, which is even more pronounced for the top 25, and especially Putnam fellows.

Of course, the competition is not aimed at helping graduate admissions, as emphasized by the mid-March results date (way after the applications are due and the admission decisions are made). It does not even make the scores available in any official format. In fact, historically, it is primarily a team competition, a nerdy alternative to college athletics. Finally, a competition is not necessarily similar to do research. As Kedlaya said, “A contest problem is meant to be solved in the space of minutes or hours, whereas in research, one sometimes works on the same problem for days, months, occasionally even years.”

#### 4. A bissel of analysis

**(a)** **GRE Math.** While useful to some extend, mostly for the middle and bottom scoring students, it is largely useless for most of the better prepared students. Indeed, in the “upper middle range” of 75 to 90 percentile, the test scores range between 770 and 850, comprising about 500 students every year. By the rules of GRE, many of these students cannot be even compared. Those who can, it is unclear whether they really are better candidates for doing research and teaching in mathematics. Indeed, the excessive emphasis on calculus, real analysis and linear algebra shows the student’s ability to memorize concepts and quickly perform routine tasks. This does not test problem solving. Neither do “other topics” which are heavily testing definitions of a group, ring, metric space, etc. I bet the performance in this part strongly correlates with the quality of the undergraduate institution: better colleges offer more serious math classes, and GRE Math preparation classes, which cover these basic topics; others do not.

For the top 10%, the GRE Math scores does distinguish between them, but that’s hardly necessary. Of the top 250-300 students over half of them are international and often come with accolades like “the best student in N years from the XYZ university.” Last year I recall even one European student described as “the best student since World War II from … country”. Those 100-150 that are from the US, are well served with numerous REU programs both national and at their home universities, by the Budapest and Moscow semesters, Putnam, IMO and other competitions, etc. Their GRE scores seem irrelevant in retrospect.

Now, using AMS Classification, Group I of 48 top math graduate programs graduates about 550 Ph.D’s. All are research oriented. I am guesstimating that they must be accepting c. 800 students in total. So after the top 300 are accepted, how are they suppose to choose the next 500 if GRE is irrelevant?

**(b) Putnam. **Even though a majority receive only single digit score, there is a clear benefit for the top programs to know who the winners are. The top 25 individuals, clearly possess excellent problem solving abilities, which is useful in a number of areas of mathematics. The are multiple problems with this. First, it would be nice to have the list of winners available by December. Second, it would be nice if Putnam is offered overseas. But even for the US/Canada based students, as it stands, the senior’s performance is not counted in admissions due calendar issues. Since students often are encouraged to take their junior year abroad, the best performance they can include in their applications is from their sophomore year, which is often inferior to their senior year performance. So with exception of the truly top students, Putnam results are not used in the admissions.

#### 5. A modest proposal, Russian style

**(a) GRE Math. ** Split the GRE Mathematics into two parts. Keep Calculus/Linear Algebra in the first half, more or less in the same multiple choice form as you have now. It is clearly helpful for middle and bottom tier students and programs. For the second part, make it a no-hard-math-required problem solving style. Make many relatively simple problems, much much simpler than IMO problems, more like Moscow olympiads for the freshman-sophomore HS years (8-9th year out of 11). This would allow relatively unbiased testing of problem solving, extremely useful to mathematics programs. Both scores would need to be reported (kind of like 4 GRE General scores).

As revenue figures suggest, ETS is essentially a large utility company which does not want to rock the boat. But it has made changes before, and this particular change would be relatively painless and have the added advantage that no “other fields” need to be argued about – all students will know exactly what is the scope of the test.

**(b) Putnam. **Ugh. It’s true that “if it ain’t broke, don’t fix it“, so I don’t want to propose major changes. Just three minor tweaks, which will not change the core competition, but hopefully will make it more democratic and helpful for graduate admissions.

***** First, move the competition to late September, so the scores can be revealed before Jan 1. I really don’t see what exactly is hard about that. Perhaps, some Putnam prep classes will have to be moved to the Spring. So what?

*** **Second, open it for international students. I know, I know, time difference, language issues, etc. Whatever, keep it on the US time and only in English, as it is now. If the overseas students want to participate, they might have to do this at night perhaps (simply allow unlimited tea, coffee and Red Bull). This is still better than not giving them an opportunity at all. Another issue is trust (in foreign faculty supervisors). For that, use the technology. Reveal the problem on some website for all at once. Videotape what’s happening in all rooms where the competition is taking place. Have *all* solutions uploaded as .pdf files to the main server within minutes after the end of the competition (they should still be graded locally, with top scores re-graded at a central location). While some of this might be an obstacle for some universities in poor countries, the majority of foreign universities already have all the necessary technology to make this happen.

*** **Third, and most controversially, at least for the US/Canadian students allow an easy “parallel track”. That is, come up with substantially easier problems which can be administered at the same time in parallel. The students should be given a choice – either real problems which are hard, or easier problems which do not count. This would be good for students’ morale as a means to prevent the annual 40% of 0 scores, and the scores can be useful for admission. I am modelling this based on the widely successful Tournament of Towns, which has two levels and two tracks (harder and easier), see this problem archive.

**P.S.** Full disclosure: I took GRE Math in 1994 and received maximal score available at that time. I recall finishing early, but missing a couple of problems possibly due to some English language difficulties. I did not participate in the Putnam – was busy in Moscow. More recently, I also participated in graduate admissions, but everywhere above made sure I use only open sources and no “inside information”.