<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>MyChances.net &#187; Elo</title>
	<atom:link href="http://www.mychances.net/blog/tag/elo/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mychances.net/blog</link>
	<description>Data-driven college admission prediction</description>
	<lastBuildDate>Sun, 07 Mar 2010 17:39:19 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>College Rankings #2: Pitfalls of Various Preference-Based Ranking Methods</title>
		<link>http://www.mychances.net/blog/2009/08/30/college-rankings-2-pitfalls-of-various-preference-based-ranking-methods/</link>
		<comments>http://www.mychances.net/blog/2009/08/30/college-rankings-2-pitfalls-of-various-preference-based-ranking-methods/#comments</comments>
		<pubDate>Mon, 31 Aug 2009 04:33:10 +0000</pubDate>
		<dc:creator>James</dc:creator>
				<category><![CDATA[Education]]></category>
		<category><![CDATA[MyChances.net]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[college]]></category>
		<category><![CDATA[college rankings]]></category>
		<category><![CDATA[Elo]]></category>
		<category><![CDATA[preference]]></category>
		<category><![CDATA[rankings]]></category>

		<guid isPermaLink="false">http://www.mychances.net/blog/?p=103</guid>
		<description><![CDATA[In my previous post, I introduced the new college rankings system that we have implemented. In short, the system ranks colleges based on where their admitted students decide to attend. In this post, I will discuss some of the approaches that might be considered in creating a preference-based ranking. In the next post, I will [...]]]></description>
			<content:encoded><![CDATA[<img style='float: left; margin-right: 10px; border: none;' src='http://www.gravatar.com/avatar.php?gravatar_id=a6b86431a750edb0d8748e2bf5a8290d&amp;default=' alt='No Gravatar' width=80 height=80/><p>In my <a title="New college rankings" href="http://www.mychances.net/blog/2009/07/10/new-college-rankings/" target="_self">previous post</a>, I introduced the new college rankings system that we have implemented. In short, the system ranks colleges based on where their admitted students decide to attend. In this post, I will discuss some of the approaches that might be considered in creating a preference-based ranking. In the next post, I will discuss the preference ranking system that we have implemented.</p>
<div id="attachment_102" class="wp-caption alignleft" style="width: 310px"><a rel="attachment wp-att-102" href="http://www.mychances.net/blog/2009/08/30/college-rankings-2-pitfalls-of-various-preference-based-ranking-methods/2009rankings/"><img class="size-full wp-image-102" title="2009rankings" src="http://www.mychances.net/blog/wp-content/uploads/2009/08/2009rankings.png" alt="2009 MyChances.net College Rankings" width="300" height="267" /></a><p class="wp-caption-text">2009 MyChances.net College Rankings</p></div>
<h2>Yield isn&#8217;t enough</h2>
<p>The goal of a preference-based ranking system is to capture people&#8217;s true preferences and represent them faithfully. To discover people&#8217;s preferences, a reasonable place to start might be a college&#8217;s <strong>yield</strong>. Yield is calculated as follows:</p>
<p>yield = (# of students attending) / (# of students accepted)</p>
<p>So how can we use yield to compare two schools? Suppose we match the University of Georgia (#70 on our list) against Pomona (#50 on our list). In this matchup, Georgia&#8217;s 55% yield actually beats out Pomona&#8217;s 39% yield. More of Georgia&#8217;s admitted students end up attending—so Georgia appears to be preferable to Pomona. But there is a problem here: we have no direct evidence that students, given the opportunity to attend either school, would choose Georgia. We simply don&#8217;t know what the students who were admitted to both schools would do.</p>
<p>In the abstract, there is another problem with this approach. Imagine that 100 students apply to both Georgia and Pomona. Suppose Pomona accepts 50 of them but Georgia accepts all 100. Now, suppose the 50 rejected from Pomona all decide to go to Georgia, giving it a 50% yield. Suppose, also, that 40 of Pomona&#8217;s accepted students also get into Harvard, Yale, or Princeton, and they all go off to those schools. This leaves Pomona with a 20% yield. Going by yield, it appears that Georgia is the preferred college by far—but in reality, all of the students admitted to both Pomona and Georgia who attended one of the two decide to go to Pomona. Yield, in this situation, gives us exactly the wrong answer about which school is preferred over the other!</p>
<h2>Each student matters</h2>
<p>What can we learn from the failure of yield as a measure of preference? Summary statistics simply don&#8217;t tell us enough. We need to drill down to the level of individual students. Only then can we build up a picture of their collective preferences. How can we do this? One approach might simply be to ask them what their preferences are. For example, we could survey a bunch of students applying to college, and ask them to order all of the schools they are considering, from most favorite to least favorite.</p>
<p>This is better; if those 100 people in our previous example honestly represented their preferences, we would probably see that the Pomona was preferred over Georgia. This is the intuitively correct result given our (fake) example. But even this approach isn&#8217;t perfect.</p>
<h2>Talk is cheap; opinions, cheaper</h2>
<p>One problem is that there is no cost associated with ranking a school #1 on your own personal list. Until you actually have to decide which college you are going to attend—and pay tuition to—for the next 4 years, your opinions have no teeth. Let&#8217;s say you rank UNC as your #1 school and Duke as your #8 (out of 8), because your Tar Heel family hates those Blue Devils. You apply, and get into both schools. Did I mention that you got a merit scholarship to Duke? All of a sudden, you find yourself attending your supposedly bottom-ranked school. You didn&#8217;t lie when you gave us your rankings, but you probably exaggerated how much you preferred UNC over Duke. Furthermore, you didn&#8217;t have all of the information that you used to make your decision—such as your merit scholarship—when you reported that Duke was your #8 school.</p>
<p>In general, asking people for their preferences leads to these additional problems:</p>
<ul>
<li>They may give feedback about colleges where their feedback is of questionable value. If someone with a 1.5 GPA says that they rank State U over Harvard, should that hurt Harvard—even though this person almost certainly wouldn&#8217;t be given the opportunity to attend there, anyways?</li>
<li>They almost certainly give feedback that is based on imperfect information. At the moment where people are making their decision to attend one school out of several that they were admitted to, they have acquired <em>as much information as they think they need</em> to make this huge decision. Beforehand—and, in particular, before they have applied to and been admitted to colleges—their stated preferences may be much more labile.</li>
</ul>
<p>Understanding these flaws helps flesh out a framework for a powerful-yet-simple preference-based college rankings system: one where students simply report where they were admitted and where they decided to attend. In my next post, I&#8217;ll get into some of the details of how to take this information and construct a ranked preference list. I&#8217;ll even demonstrate how this approach addresses a <a href="http://www.collegeadmissionspartners.com/college-admissions-counseling/us-news-2010-college-rankings/">common criticism</a> of the currently popular <a href="http://fromdc2iowa.blogspot.com/2009/08/choosing-college-and-rankings.html">college rankings</a>: that there is no way to truly distinguish between schools closely ranked (e.g., #3 vis-a-vis #5).</p>
<div id="_mcePaste" style="overflow: hidden; position: absolute; left: -10000px; top: 237px; width: 1px; height: 1px;">
<p>Essentially, the problem is that there is no cost associated with ranking a school #1 on your own personal list. Until you actually have to decide which college you are going to attend—and pay tuition to—for the next 4 years, your opinions have no teeth. Let&#8217;s say you rank UNC as your #1 school and Duke as your #8 (out of 8), because your Tar Heel family hates those Blue Devils. You apply, and get into both schools. Did I mention that you got a merit scholarship to Duke? All of a sudden, you find yourself attending your supposedly bottom-ranked school. You didn&#8217;t lie when you gave us your rankings, but you probably exaggerated how much you prefer UNC over Duke. Furthermore, you didn&#8217;t have all of the information that you used to make your decision—such as your merit scholarship—when you reported that Duke was your #8 school.</p></div>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fwww.mychances.net%2Fblog%2F2009%2F08%2F30%2Fcollege-rankings-2-pitfalls-of-various-preference-based-ranking-methods%2F&amp;linkname=College%20Rankings%20%232%3A%20Pitfalls%20of%20Various%20Preference-Based%20Ranking%20Methods"><img src="http://www.mychances.net/blog/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://www.mychances.net/blog/2009/08/30/college-rankings-2-pitfalls-of-various-preference-based-ranking-methods/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
