<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Emerging Computational Linguistics</title>
	<atom:link href="http://www.emergingcl.com/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://www.emergingcl.com</link>
	<description>A journey to understand how Language works</description>
	<lastBuildDate>Tue, 23 Feb 2010 00:24:15 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>The Power of the Human Mind</title>
		<link>http://www.emergingcl.com/?p=138</link>
		<comments>http://www.emergingcl.com/?p=138#comments</comments>
		<pubDate>Tue, 23 Feb 2010 00:24:15 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Thought Provokers]]></category>

		<guid isPermaLink="false">http://www.emergingcl.com/?p=138</guid>
		<description><![CDATA[I received this text in an email.  It pretty much says it all.
i cdnuolt blveiee taht I cluod aulaclty uesdnatnrd waht I was rdanieg. The phaonmneal pweor of the hmuan mnid, aoccdrnig to a rscheearch at Cmabrigde Uinervtisy, it dseno&#8217;t mtaetr in waht oerdr the ltteres in a wrod are, the olny iproamtnt tihng is [...]]]></description>
			<content:encoded><![CDATA[<p>I received this text in an email.  It pretty much says it all.</p>
<blockquote><p>i cdnuolt blveiee taht I cluod aulaclty uesdnatnrd waht I was rdanieg. The phaonmneal pweor of the hmuan mnid, aoccdrnig to a rscheearch at Cmabrigde Uinervtisy, it dseno&#8217;t mtaetr in waht oerdr the ltteres in a wrod are, the olny iproamtnt tihng is taht the frsit and lsat ltteer be in the rghit pclae. The rset can be a taotl mses and you can sitll raed it whotuit a pboerlm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the wrod as a wlohe. Azanmig huh? yaeh and I awlyas tghuhot slpeling was ipmorantt!</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://www.emergingcl.com/?feed=rss2&amp;p=138</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>More Ideas from Yoav Seginer</title>
		<link>http://www.emergingcl.com/?p=133</link>
		<comments>http://www.emergingcl.com/?p=133#comments</comments>
		<pubDate>Sun, 07 Feb 2010 03:48:40 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Thought Provokers]]></category>

		<guid isPermaLink="false">http://www.emergingcl.com/?p=133</guid>
		<description><![CDATA[Yoav Seginer wrote his dissertation on the Incremental Parser.  The paper is pretty easy to read – accessible.  The introduction is an especially well written introduction to unsupervised grammar induction.
I was surprised to read what he had to say about substitutability.   Substitutability is the capacity to replace phrases with other phrases that are of the [...]]]></description>
			<content:encoded><![CDATA[<p>Yoav Seginer wrote his dissertation on the Incremental Parser.  The paper is pretty easy to read – accessible.  The introduction is an especially well written introduction to unsupervised grammar induction.</p>
<p>I was surprised to read what he had to say about substitutability.   Substitutability is the capacity to replace phrases with other phrases that are of the same type.  For example ‘the dog’ in ‘the dog ran to town’ can be replaced with ‘it’.   So in some sense, the phrase ‘the dog’ and ‘it’ can be substituted for each other.  This is one of the cornerstones of linguistic theory and is used as a basis of many parsing techniques.  The PCFG parser uses probabilities for a phrase type that can be traded out in a given context.</p>
<p>However, Seginer makes the claim that substitutability is not required for his incremental parser.</p>
<p style="padding-left: 30px;">Substitutability, the essential idea of the Harris method, which has been seen as a starting point for the induction process for so long, turns out to be unnecessary in unsupervised parsing.</p>
<p style="padding-left: 30px;"> …unlabeled parsing which only requires the parser to identify the constituents (or dependency links) but does not require them to be labeled, is purely syntagmatic (by definition).   A parser induction algorithm can therefore focus on learning to detect syntactic units while ignoring substitutability. (p20)</p>
<p>Another fresh idea (to me) from Seginer’s paper is the skewness of language structure.</p>
<p style="padding-left: 30px;">The syntactic structure of natural language is skewed. This simply means that when the syntactic structure of an utterance is represented by a tree, each node in the tree has at least one short branch. The shorter the shortest branch is, the greater the skewness.  (p22)</p>
<p>Essentially how the incremental parser takes advantage of skewness is to expect skewness in the parse result.  This reduces the search size and thereby make the parsing process more efficient.</p>
<p style="padding-left: 30px;">…context free grammars, allow (a-priori) any tree structure and, therefore, a learning algorithm for such representations must discover by itself the skewness property of syntactic trees. However, if this property is indeed universal, there is no need to burden the learning algorithm with its discovery and it is possible to code skewness directly into the parser.  (p23)</p>
<p>He claims that ‘coding the skewness’ into the syntactic representation and the parser, i.e., expecting branches to be of mixed depths, does not retract from the accuracy of the parse result.</p>
<p>Here is a link to Yoav’s dissertation.  <a href="http://dare.uva.nl/document/52554">Dissertation</a></p>
<p>Since graduating in 2007, it looks like Yoav Seginer is working at a  small company in Amsterdam, Mondria Technologies Ltd (according to LinkedIn.com).  The company website doesn’t say anything yet.  I wonder if they are working on a project that uses the incremental parser.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.emergingcl.com/?feed=rss2&amp;p=133</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Incremental Unsupervised Grammar Learning</title>
		<link>http://www.emergingcl.com/?p=129</link>
		<comments>http://www.emergingcl.com/?p=129#comments</comments>
		<pubDate>Fri, 05 Feb 2010 18:18:09 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Thought Provokers]]></category>

		<guid isPermaLink="false">http://www.emergingcl.com/?p=129</guid>
		<description><![CDATA[This paper by Yoav Seginer is very exciting.  It covers a method of learning a language grammar that resonates with my mental model of how a young child learns language.  These are some aspects of the Seginer algorithm.
Incremental Learning.  The system adds information to the grammar with each new sentence.  In other approaches (CCM by [...]]]></description>
			<content:encoded><![CDATA[<p>This paper by Yoav Seginer is very exciting.  It covers a method of learning a language grammar that resonates with my mental model of how a young child learns language.  These are some aspects of the Seginer algorithm.</p>
<p><strong>Incremental Learning.</strong>  The system adds information to the grammar with each new sentence.  In other approaches (CCM by Manning and Klein, UDOP by Bod), the entire corpus is processed as a block to collect the parameters of the grammar and the entire corpus is repeatedly processed until the learning converges on final result.  In Seginer’s approach, the corpus is processed one sentence at a time.  After each sentence, the grammar weights are updated.  That sentence is not revisited for further training.</p>
<p><strong>Simpler Math. </strong> The approach by Seginer uses much simpler math for computing the parameters of the resulting grammar.   There are a few ratios, some accumulation of values into other, and some comparison of weights to choose which one to apply for a given step.  There are no long chains of probabilities to compute the best parse.  I don’t have any principled reason why this makes more sense for a model of how the brain works – it just fits my gut feeling better.</p>
<p><strong>Not Restricted to Binary Trees.</strong>  The CMM approach and others approaches that are referenced in Seginer’s paper all give binary parse trees as the result.  But natural languages aren’t limited to binary trees.  Although it is possible to represent any non-binary tree as a binary tree, forcing a binary representation onto natural language is not adding to the value of the result.  Seginer’s process gives parse trees that include constructions of more than two nodes.</p>
<p><strong>Exocentric Constructions.</strong>   A phrase like ‘the boy’ has links going both directions between the two words.  Either word can be considered the head of the phrase – which in some way matches the disagreement between linguists about which word is actually the head of the phrase.</p>
<p><strong>Link depths.</strong>  His algorithm result include a depth value on dependency links.  This value can be used to distinguish between internal and external arguments to a phrase.</p>
<p><strong>Incremental Parsing.</strong>  For each word read into the sentence, links can only be added to or from the new word.</p>
<p>             I know the boy (sleeps)</p>
<p>When the algorithm encounters the word <em>sleeps</em> it can only add a link to or from this word.  No links that were generated at earlier steps can be affected.   This reduces the search space and contributes to the overall speed of the parser.  Another advantage is that the links in ‘I know the boy’ are an exact subset of the links in ‘I know the boy sleeps’.  There is no decision necessary about dropping links that were previously discovered.</p>
<p><strong>No Clustering.</strong>  Seginer’s approach groups words with class labels that in some respect are part of speech classes.  However, the approach does not require clustering of words into specific classes.  It finds similarities between <em>the</em> and <em>a</em> but it does not require that all determiners are grouped together in a well defined class.  This makes the approach better able to deal with noise in the training data.</p>
<p><strong>Homophony.</strong>  When a word has more than one meaning, it can confuse any machine algorithm.  Seginer’s algorithm deals with homophony by comparing labels between target pairs of words.  In the example that follows, words in brackets are possible labels for the target words.  When an underscore appears in the label, it means look at  the label of the preceding or following word.</p>
<p>             This[the] year[_the]    (this is a determiner)</p>
<p>            This[is_] was [is]          (this is a pronoun).</p>
<p>In the example given above, in one case <em>this</em> is a determiner.  One of its labels is [the] because <em>this</em> and <em>the</em> occur in similar places.  In the second line a label for <em>this</em> is [is_] which means that <em>this</em> is frequently followed by <em>is</em>.  The algorithm finds pairs of labels such as [the] and [_the] or [is_] and [is] to decide where to put links.</p>
<p>I think Seginer’s approach has some real merit.  It is a greedy algorithm – it learns as it goes and does not need to revisit previous sentences.  It uses simple math.  It has results that seem to match psycholinguistic models.   It would be great to see this approach extended to take advantage of other linguistic phenomena such as morphology.</p>
<p> A link to the paper can be found here.  <a href="http://www.aclweb.org/anthology/P07-1049">Paper</a></p>
<p> A link to a video presentation on the approach can be found here.    <a href="http://videolectures.net/mlcs07_seginer_ili/">Video</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.emergingcl.com/?feed=rss2&amp;p=129</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Important Language Characteristic</title>
		<link>http://www.emergingcl.com/?p=124</link>
		<comments>http://www.emergingcl.com/?p=124#comments</comments>
		<pubDate>Mon, 11 Jan 2010 05:15:37 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Thought Provokers]]></category>

		<guid isPermaLink="false">http://www.emergingcl.com/?p=124</guid>
		<description><![CDATA[Today I was pondering the issue of head directionality of languages &#8211; does the head of a phrase come before or after the remaining portion of the phrase.  This parameter has been largely disregarded because most languages are inconsistent its use.
However, it occurred to me that infants could rely on the heavy weighting of head [...]]]></description>
			<content:encoded><![CDATA[<p>Today I was pondering the issue of head directionality of languages &#8211; does the head of a phrase come before or after the remaining portion of the phrase.  This parameter has been largely disregarded because most languages are inconsistent its use.</p>
<p>However, it occurred to me that infants could rely on the heavy weighting of head directionality when they are first learning a language.  By choosing the direction that is predominant and ignoring the incoming utterances of the opposite direction, it would simplify the initial learning phase.</p>
<p>Perhaps someone has already proposed this little insight into child language acquisition.</p>
<p>As I was pondering this, it also occurred to me that the single most important characteristic of a language is that it can be learned by an infant.  If the majority of infants in a culture can&#8217;t learn the language of their parents, then that language will not persist.  It will die out.</p>
<p>Perhaps someone else has already proposed this little insight as well.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.emergingcl.com/?feed=rss2&amp;p=124</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Videos on Unsupervised Learning of Syntax</title>
		<link>http://www.emergingcl.com/?p=102</link>
		<comments>http://www.emergingcl.com/?p=102#comments</comments>
		<pubDate>Fri, 01 Jan 2010 18:15:30 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Thought Provokers]]></category>

		<guid isPermaLink="false">http://www.emergingcl.com/?p=102</guid>
		<description><![CDATA[Here are two links to videos on Unsupervised Learning of natural language syntax.
Chris Manning gave this talk at MLCS 2007.  It is a fairly detailed discussion of the work that Chris did with Dan Klein on learning syntax structure.  Dan used the material for his dissertation.
Chris Manning Video
Dan Klein gave this talk at UAI 2008.  [...]]]></description>
			<content:encoded><![CDATA[<p>Here are two links to videos on Unsupervised Learning of natural language syntax.</p>
<p>Chris Manning gave this talk at MLCS 2007.  It is a fairly detailed discussion of the work that Chris did with Dan Klein on learning syntax structure.  Dan used the material for his dissertation.</p>
<p><a title="Chris Manning Video" href="http://videolectures.net/mlcs07_manning_uls/" target="_blank">Chris Manning Video</a></p>
<p>Dan Klein gave this talk at UAI 2008.  It is more high level, and describes how Dan and his students have applied their work to a few other areas of unsupervised learning<strong>.</strong></p>
<p><a title="Dan Klein Video" href="http://videolectures.net/uai08_klein_ul/" target="_blank">Dan Klein Video</a></p>
<p>Both these videos are worth watching.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.emergingcl.com/?feed=rss2&amp;p=102</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Learning is a life long adventure</title>
		<link>http://www.emergingcl.com/?p=86</link>
		<comments>http://www.emergingcl.com/?p=86#comments</comments>
		<pubDate>Thu, 03 Sep 2009 03:12:04 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://69.89.21.80/~emergin2/?p=86</guid>
		<description><![CDATA[I finished my MA degree at the University of Washington.  Many thanks to the faculty and staff that helped me finish.  Many thanks to my wife and family that supported me while I struggled through it.
However, this week I was reminded that there are limits to how much you can learn in a degree program [...]]]></description>
			<content:encoded><![CDATA[<p>I finished my MA degree at the University of Washington.  Many thanks to the faculty and staff that helped me finish.  Many thanks to my wife and family that supported me while I struggled through it.</p>
<p>However, this week I was reminded that there are limits to how much you can learn in a degree program that lasts only 5 quarters.   A blog at LingPipe (<a href="http://lingpipe-blog.com/2009/06/08/comparing-bernoulli-and-multinomial-reply/" title="a review of my blog">a review of my blog</a>) (<a href="http://lingpipe-blog.com" target="_blank">lingpipe-blog.com</a>) pointed out several errors in what I had written about here in one of my posts.  Well, lucky for me I hadn&#8217;t intended the text to be submitted to a conference jury for review.  The paper was just a one-week assignment.  As I remember, I passed the class, so the paper probably met the requirements for the course even though it has very clear short comings in the eye of the LingPipe reviewer.</p>
<p>Maybe I will find the time to dig out the code that was used for that paper, and compare it against the evaluation at LingPipe and see if I can understand what the LingPipe objections were.  I might even try to re-implement the code following the expert&#8217;s directions.</p>
<p>Another way I look at this is: the more you know, the more you know that you don&#8217;t know.  It has been healthy for me to receive some constructive criticism of my post.  I have been writing software for only 30 years, so maybe in the next 30 years, I can learn how to fix the problems that LingPipe pointed out.  For now, I am just an average software engineer.  Code reviews are part of the process.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.emergingcl.com/?feed=rss2&amp;p=86</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Stuff of Thought</title>
		<link>http://www.emergingcl.com/?p=84</link>
		<comments>http://www.emergingcl.com/?p=84#comments</comments>
		<pubDate>Sat, 06 Jun 2009 04:46:55 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Thought Provokers]]></category>

		<guid isPermaLink="false">http://69.89.21.80/~emergin2/?p=84</guid>
		<description><![CDATA[I have been reading The Stuff of Thought by Steven Pinker which is a survey of different ways language gives clues to how the mind works.  (See my related entry dated September 28, 2007)  In the fourth chapter, he gives a very enlightening portrayal of how language describes objects in space. 
&#8220;Languages tend to have terms [...]]]></description>
			<content:encoded><![CDATA[<p>I have been reading <em>The Stuff of Thought</em> by Steven Pinker which is a survey of different ways language gives clues to how the mind works.  (See my related entry dated September 28, 2007)  In the fourth chapter, he gives a very enlightening portrayal of how language describes objects in space. </p>
<p>&#8220;Languages tend to have terms for contact, vertical alignment, attachment, containment, and proximity, as if there were a cognitive alphabet of spatial relationships more basic than the prepositions of a given language.&#8221;  (p. 178)</p>
<p>&#8220;A light bulb is considered to be <em>in</em> a socket when its base is inserted, since that allows it to be illuminated, but a person is not <em>in</em> a car if only his arm extends in through a window, since that doesn&#8217;t allow the car to move him or even shelter him.&#8221; (p. 187)  The meaning of the preposition <em>in</em> depends on the objects that are being described.</p>
<p>&#8220;If Sally has one big stone and Jenny three much smaller stones, who has more?  The question by itself is unanswerable: it depends on whether you mean &#8220;more stone&#8221;, or &#8220;more stones.&#8221;  (p. 173)   The meaning of <em>more</em> depends on whether it is referring to the number of objects (stones) or the mass/volume/weight of the object (stone).</p>
<p>&#8220;The part of the mind that interfaces with language treats objects schematically.  &#8230; Every morsel of matter has a length, a width, and a thickness, but when we speak of these morsels we pretend that some of the dimensions aren&#8217;t there. &#8230; A road, a river, or a ribbon is conceptualized as an unbounded line (its length which serves as its single primary dimension) fattened out by a bounded line (its width which serves as a secondary dimension), resulting in a surface.&#8221;  (pp. 179, 180)</p>
<p> &#8221;Since words and syllables aren&#8217;t free, languages economize when they can. &#8230; Imagine you are in a rainstorm, ten feet away from an overhanging ledge.  Move one foot toward it, you still get wet.  Move over another foot; you still get wet.  Keep moving, and at some point you no longer get wet.  Continue to move another foot in the same direction, you don&#8217;t get any dryer.  So nature has set up a discontinuity between the segment of the path where gradual changes of position leave you equally wet and the segment where gradual changes leave you equally dry.  And it is exactly at that discontinuity that one would begin to describe your position using <em>under</em> rather than <em>near</em>.&#8221;  (P 186)</p>
<p>&#8220;Spatial terms quantize space at the cusps where causal events play out differently on each side.  As your palm gradually [wraps] around a marble, the curvature at which you stop saying the marble is <em>on</em> the hand and start saying it&#8217;s <em>in</em> is more or less the shape that would prevent it from rolling off when you jiggle it.&#8221;  (P 186)</p>
<p>Dr. Pinker&#8217;s premise in this book is that language reflects our thoughts.  By disecting our language, we get a glimpse of how the thought engine behind the language works.  We use count nouns and mass nouns in language because our minds see countable items such as <em>chairs</em> or <em>dogs</em> and our minds also see non-countable mass objects such as <em>water</em> or <em>furniture</em>.  We use a preposition like <em>along</em> to describe proximity to a one-dimensional line and we use <em>inside</em> to describe containment in a two or three dimensional object. </p>
<p>All languages take slightly different approaches to describing space, but there are similarities that can possibly be used to infer an underlying brain structure that helps define our language.  &#8220;Most of the world&#8217;s languages divide the space around the speaker into just two regions, though about a quarter of them (including Spanish) make a three-way distinction among &#8216;near me&#8217;, &#8216;far from me&#8217;, and &#8216;in between,&#8217; and a very few go to four, adding &#8216;very far from me&#8217;.&#8221; (p. 178)  He is referring to the English terms <em>here</em> (near me) and <em>there </em>(far from me).</p>
<p>&#8220;Not all languages carve [spatial relationships] up in the same way.  Presumably this is because each language trades off expressiveness, precision, word length, and vocabulary size in a different way.  But the quantization of spatial relations is universal, and causally important relations like contact, attachment, alignment, verticality and proximity make their appearance in all the spatial vocabularies of the world.&#8221;  (p. 187)</p>
<p>The book is an excellent example of Dr. Pinker&#8217;s writing &#8211; it is entertaining while at the same time being specific and to the point.  He digs into the issues and comes at them from all aspects &#8211; cognitive psychology, neuro-science, pathology, and child language acquisition.  He is an academic, and at the same time he presents his material in a way that is concise and engaging.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.emergingcl.com/?feed=rss2&amp;p=84</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Open Domain Question Answering</title>
		<link>http://www.emergingcl.com/?p=82</link>
		<comments>http://www.emergingcl.com/?p=82#comments</comments>
		<pubDate>Tue, 20 May 2008 00:00:57 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Local Development]]></category>

		<guid isPermaLink="false">http://69.89.21.80/~emergin2/?p=82</guid>
		<description><![CDATA[As part of the CLMA program here at University of Washington, we were asked to investigate an area of interest and write a short summary of what we read about.
So I chose to look at the TREC Question Answering challenge and read about some of the techniques that were used for submissions to that evaluation.  [...]]]></description>
			<content:encoded><![CDATA[<p>As part of the CLMA program here at University of Washington, we were asked to investigate an area of interest and write a short summary of what we read about.</p>
<p>So I chose to look at the TREC Question Answering challenge and read about some of the techniques that were used for submissions to that evaluation.  The kinds of questions that are used for this include simple factoid questions such as &#8220;How many calories are in a Big Mac?&#8221;, list questions such as &#8220;Which past and present NFL players have the last name of Johnson?”, and definition question such as &#8220;What is a Golden Parachute?&#8221;.</p>
<p>Many of the system descriptions submitted to TREC have pipelined architectures.  Here are some examples of components that were described:</p>
<ul>
<li>Question Type Classification &#8211; Decide what the answer should be like.  Is it asking for a name, a date, a description or a list, etc.</li>
<li>Question Rewrite &#8211; Convert the question to search terms that are used for the IR search.  This step typically expands the list of words used in the search by adding synonyms.  This gives a broader list of pages returned that can then be further analyzed.</li>
<li>IR Search &#8211; generally a simple search using established technologies.</li>
<li>Passage Selection &#8211; deciding which portions of the text returned by the IR search to include in the results.</li>
<li>Answer extraction and ranking &#8211; creating the actual text that will be returned as answers.</li>
</ul>
<p>The paper is not intended to be a thorough description of the field.  It helped me get a handle on what current designs are being developed.</p>
<p>Here is the link to my exploration paper: <a href="http://www.emergingcl.com/wp-content/uploads/2008/05/bobnew_trec_qa.pdf" title="QuestionAnsweringAtTrec">QuestionAnsweringAtTrec</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.emergingcl.com/?feed=rss2&amp;p=82</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Entailment Revisited</title>
		<link>http://www.emergingcl.com/?p=79</link>
		<comments>http://www.emergingcl.com/?p=79#comments</comments>
		<pubDate>Mon, 04 Feb 2008 00:32:51 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Local Development]]></category>

		<guid isPermaLink="false">http://69.89.21.80/~emergin2/?p=79</guid>
		<description><![CDATA[In a previous entry, I wrote about the Entailment Challenge.  Entailment is a linguistic knowledge concept that concerns two sentences about an event or idea.  It is said that sentence A &#8216;entails&#8217; sentence B when all of the meaning in B is contained in A.  Follow this link to see more discussion: [...]]]></description>
			<content:encoded><![CDATA[<p>In a previous entry, I wrote about the Entailment Challenge.  Entailment is a linguistic knowledge concept that concerns two sentences about an event or idea.  It is said that sentence A &#8216;entails&#8217; sentence B when all of the meaning in B is contained in A.  Follow this link to see more discussion:  <a href="index.php?p=34">Recognizing Entailment.</a></p>
<p>As part of the Master&#8217;s program in Computational Linguistics at the University of Washington, we are preparing for internship appointments in the summer of 2008.  As part of that preparation, we were asked to pick a topic that concerns Computational Linguistics and write a short summary of a few papers that covered the topic.  This is a PDF of that paper.  <a href="http://69.89.21.80/%7Eemergin2/wp-content/uploads/2008/03/bobnew_preinternshiptopic.pdf" title="PreInternship Topic">PreInternship Topic</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.emergingcl.com/?feed=rss2&amp;p=79</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Comparing Bernoulli and Multinomial</title>
		<link>http://www.emergingcl.com/?p=77</link>
		<comments>http://www.emergingcl.com/?p=77#comments</comments>
		<pubDate>Sun, 03 Feb 2008 22:10:01 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Local Development]]></category>

		<guid isPermaLink="false">http://69.89.21.80/~emergin2/?p=77</guid>
		<description><![CDATA[As part of our NLP statistical processing class in the CLMA program at the University of Washington, we did a comparison of Naive-Bayes learning algorithms.  We compared the Bernoulli and Multinomial approaches to this problem and the results are shown in the table below.
This test was run by &#8216;training&#8217; a classifier on a set [...]]]></description>
			<content:encoded><![CDATA[<p>As part of our NLP statistical processing class in the CLMA program at the University of Washington, we did a comparison of Naive-Bayes learning algorithms.  We compared the Bernoulli and Multinomial approaches to this problem and the results are shown in the table below.</p>
<p>This test was run by &#8216;training&#8217; a classifier on a set of data instances.  Each instance has a vector of features.  In the Bernoulli case, we treat the features as binary.  In the multinomial case, we treat the features as a numeric value from 0 to n where n is the number of instances the given word was found in the instance.  Both sets of data have a &#8216;class&#8217; assigned to them.  After we train on the data, we check our system results with the actual class value assigned to each instance.  This is where the percent accuracy comes from.</p>
<p>Of course these numbers are from a single test on one set of training and test data, but the fact that the Multinomial results are 91% accurate compared to 88% accurate for the Bernoulli is pretty telling.  Also you can see that the elapsed runtime for the test is significantly different as well.  The &#8216;cross_prob_delta&#8217; is a parameter for tweaking the &#8216;add-one&#8217; smoothing in the training stage.</p>
<blockquote><p>Bernoull</p></blockquote>
<blockquote>
<table cellspacing="0" cellpadding="0" border="1" style="border: medium none ; border-collapse: collapse" class="MsoTableGrid">
<tr>
<td valign="top" style="border: 1pt solid windowtext; padding: 0in 5.4pt; width: 95px">
<p class="MsoNormal">Cross<br />
prob<br />
delta</td>
<td valign="top" style="border-style: solid solid solid none; border-color: windowtext windowtext windowtext -moz-use-text-color; border-width: 1pt 1pt 1pt medium; padding: 0in 5.4pt; width: 84px">
<p class="MsoNormal">Training Accuracy</p>
</td>
<td valign="top" style="border-style: solid solid solid none; border-color: windowtext windowtext windowtext -moz-use-text-color; border-width: 1pt 1pt 1pt medium; padding: 0in 5.4pt; width: 78px">
<p class="MsoNormal">Test Accuracy</p>
</td>
<td valign="top" style="border-style: solid solid solid none; border-color: windowtext windowtext windowtext -moz-use-text-color; border-width: 1pt 1pt 1pt medium; padding: 0in 5.4pt; width: 90px">
<p class="MsoNormal">Wall Clock (seconds)</p>
</td>
</tr>
<tr>
<td valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 95px">
<p class="MsoNormal">0.1</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 84px">
<p class="MsoNormal">0.9303</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 78px">
<p class="MsoNormal">0.8800</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 90px">
<p class="MsoNormal">303.23</p>
</td>
</tr>
<tr>
<td valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 95px">
<p class="MsoNormal">0.5</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 84px">
<p class="MsoNormal">0.9103</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 78px">
<p class="MsoNormal">0.8633</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 90px">
<p class="MsoNormal">318.10</p>
</td>
</tr>
<tr>
<td valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 95px">
<p class="MsoNormal">1.0</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 84px">
<p class="MsoNormal">0.8970</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 78px">
<p class="MsoNormal">0.8400</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 90px">
<p class="MsoNormal">305.45</p>
</td>
</tr>
<tr>
<td valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 95px">
<p class="MsoNormal">2.0</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 84px">
<p class="MsoNormal">0.8796</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 78px">
<p class="MsoNormal">0.8233</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 90px">
<p class="MsoNormal">305.56</p>
</td>
</tr>
</table>
<p class="MsoNormal">Multinomial</p>
</blockquote>
<blockquote>
<table cellspacing="0" cellpadding="0" border="1" style="border: medium none ; border-collapse: collapse" class="MsoTableGrid">
<tr>
<td valign="top" style="border: 1pt solid windowtext; padding: 0in 5.4pt; width: 95px">
<p class="MsoNormal">Cross prob<br />
delta</td>
<td valign="top" style="border-style: solid solid solid none; border-color: windowtext windowtext windowtext -moz-use-text-color; border-width: 1pt 1pt 1pt medium; padding: 0in 5.4pt; width: 84px">
<p class="MsoNormal">Training Accuracy</p>
</td>
<td valign="top" style="border-style: solid solid solid none; border-color: windowtext windowtext windowtext -moz-use-text-color; border-width: 1pt 1pt 1pt medium; padding: 0in 5.4pt; width: 78px">
<p class="MsoNormal">Test Accuracy</p>
</td>
<td valign="top" style="border-style: solid solid solid none; border-color: windowtext windowtext windowtext -moz-use-text-color; border-width: 1pt 1pt 1pt medium; padding: 0in 5.4pt; width: 90px">
<p class="MsoNormal">Wall Clock (seconds)</p>
</td>
</tr>
<tr>
<td valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 95px">
<p class="MsoNormal">0.1</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 84px">
<p class="MsoNormal">0.9570</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 78px">
<p class="MsoNormal">0.9133</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 90px">
<p class="MsoNormal">8.33</p>
</td>
</tr>
<tr>
<td valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 95px">
<p class="MsoNormal">0.5</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 84px">
<p class="MsoNormal">0.9503</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 78px">
<p class="MsoNormal">0.9066</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 90px">
<p class="MsoNormal">8.50</p>
</td>
</tr>
<tr>
<td valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 95px">
<p class="MsoNormal">1.0</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 84px">
<p class="MsoNormal">0.9448</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 78px">
<p class="MsoNormal">0.9000</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 90px">
<p class="MsoNormal">8.29</p>
</td>
</tr>
<tr>
<td valign="top" style="border-style: none solid solid; border-color: -moz-use-text-color windowtext windowtext; border-width: medium 1pt 1pt; padding: 0in 5.4pt; width: 95px">
<p class="MsoNormal">2.0</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 84px">
<p class="MsoNormal">0.9400</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 78px">
<p class="MsoNormal">0.8966</p>
</td>
<td valign="top" style="border-style: none solid solid none; border-color: -moz-use-text-color windowtext windowtext -moz-use-text-color; border-width: medium 1pt 1pt medium; padding: 0in 5.4pt; width: 90px">
<p class="MsoNormal">8.26</p>
</td>
</tr>
</table>
</blockquote>
<p>While we were working on this evaluation, we were brainstorming on other ways to compare the Bernoulli and Multinomial approaches.  This is what we came up with.</p>
<ul>
<li>Office chair comparison.  Put signs on two contestants that are sitting in office chairs.  The signs are &#8216;Bernoulli&#8217; and &#8216;Multinomial&#8217;.  The contestants race down the hallway without lifting themselves out of the chair.  The first sign to the finish line is the method of choice.</li>
<li>Date getting comparison.  Again, put signs on two contestants.  Have them stand back to back in the center of the student lounge and randomly ask girls for dates.  The sign that gets the most dates is the method of choice.</li>
</ul>
<p>All kidding aside, the NLP stats processing class is challenging and fun.  We are gaining insight into how these methods can be used for basic classification of data.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.emergingcl.com/?feed=rss2&amp;p=77</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
