<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: How to detect which language a text is written in? Or when science meets human!</title>
	<atom:link href="http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/feed/" rel="self" type="application/rss+xml" />
	<link>http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/</link>
	<description></description>
	<lastBuildDate>Sun, 22 Nov 2009 23:35:16 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: freelancer</title>
		<link>http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1819</link>
		<dc:creator>freelancer</dc:creator>
		<pubDate>Mon, 27 Apr 2009 14:29:57 +0000</pubDate>
		<guid isPermaLink="false">http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1819</guid>
		<description>This n-gram based tool written in PHP detects a site language depending on its content (for a user specified URL)

http://www.site-language.com

Sometimes have problems with non utf-8 encodings, but mostly works ok.</description>
		<content:encoded><![CDATA[<p>This n-gram based tool written in PHP detects a site language depending on its content (for a user specified URL)</p>
<p><a href="http://www.site-language.com" rel="nofollow">http://www.site-language.com</a></p>
<p>Sometimes have problems with non utf-8 encodings, but mostly works ok.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: feedbackmine</title>
		<link>http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1815</link>
		<dc:creator>feedbackmine</dc:creator>
		<pubDate>Thu, 09 Apr 2009 05:17:52 +0000</pubDate>
		<guid isPermaLink="false">http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1815</guid>
		<description>Here is my n-gram based language detector, written in ruby:

http://github.com/feedbackmine/language_detector/tree/master

I have been using it in production and it performs very well.</description>
		<content:encoded><![CDATA[<p>Here is my n-gram based language detector, written in ruby:</p>
<p><a href="http://github.com/feedbackmine/language_detector/tree/master" rel="nofollow">http://github.com/feedbackmine/language_detector/tree/master</a></p>
<p>I have been using it in production and it performs very well.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Check string language &#124; keyongtech</title>
		<link>http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1806</link>
		<dc:creator>Check string language &#124; keyongtech</dc:creator>
		<pubDate>Sun, 18 Jan 2009 17:07:58 +0000</pubDate>
		<guid isPermaLink="false">http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1806</guid>
		<description>[...] Check string language     Also : http://tnlessone.wordpress.com/2007/...e-meets-human/  and more generally : http://www.google.fr/search?hl=fr&amp;q=...+natural&amp;meta=  -- Patrice  [...]</description>
		<content:encoded><![CDATA[<p>[...] Check string language     Also : <a href="http://tnlessone.wordpress.com/2007/...e-meets-human/" rel="nofollow">http://tnlessone.wordpress.com/2007/&#8230;e-meets-human/</a>  and more generally : <a href="http://www.google.fr/search?hl=fr&amp;q=...+natural&amp;meta=" rel="nofollow">http://www.google.fr/search?hl=fr&amp;q=&#8230;+natural&amp;meta=</a>  &#8212; Patrice  [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Neotenic</title>
		<link>http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1792</link>
		<dc:creator>Neotenic</dc:creator>
		<pubDate>Fri, 11 Jul 2008 23:07:01 +0000</pubDate>
		<guid isPermaLink="false">http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1792</guid>
		<description>I have made a language detector. It’s available here:
&lt;a href=&quot;http://www.anticulture.net&quot; rel=&quot;nofollow&quot;&gt;Language Detector&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>I have made a language detector. It’s available here:<br />
<a href="http://www.anticulture.net" rel="nofollow">Language Detector</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Neotenic</title>
		<link>http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1791</link>
		<dc:creator>Neotenic</dc:creator>
		<pubDate>Fri, 11 Jul 2008 23:03:39 +0000</pubDate>
		<guid isPermaLink="false">http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1791</guid>
		<description>I have made a language detector. It&#039;s available here:
&lt;a href=&quot;http://www.anticulture.net/LanguageDetector.php&quot; rel=&quot;nofollow&quot;&gt;Language Detector&lt;/a&gt;</description>
		<content:encoded><![CDATA[<p>I have made a language detector. It&#8217;s available here:<br />
<a href="http://www.anticulture.net/LanguageDetector.php" rel="nofollow">Language Detector</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: My daily readings 03/06/2008 &#171; Strange Kite</title>
		<link>http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1771</link>
		<dc:creator>My daily readings 03/06/2008 &#171; Strange Kite</dc:creator>
		<pubDate>Thu, 06 Mar 2008 11:50:14 +0000</pubDate>
		<guid isPermaLink="false">http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1771</guid>
		<description>[...] How to detect which language a text is written in? Or when science meets human! « The Nameless One [...]</description>
		<content:encoded><![CDATA[<p>[...] How to detect which language a text is written in? Or when science meets human! « The Nameless One [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: test 03/05/2008 &#171; Strange Kite</title>
		<link>http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1770</link>
		<dc:creator>test 03/05/2008 &#171; Strange Kite</dc:creator>
		<pubDate>Wed, 05 Mar 2008 17:50:16 +0000</pubDate>
		<guid isPermaLink="false">http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1770</guid>
		<description>[...] How to detect which language a text is written in? Or when science meets human! « The Nameless One [...]</description>
		<content:encoded><![CDATA[<p>[...] How to detect which language a text is written in? Or when science meets human! « The Nameless One [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Morticia</title>
		<link>http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1727</link>
		<dc:creator>Morticia</dc:creator>
		<pubDate>Tue, 27 Nov 2007 23:04:00 +0000</pubDate>
		<guid isPermaLink="false">http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-1727</guid>
		<description>I bought a scarf the other day and it has some writing on it. I&#039;m really curious as to what it says because I Don&#039;t want to wear it if its some sort of satanic rambling. By looking at the print on it you&#039;d say it was turkish or iranian or sopmething but I tried to translate from a few languages and still couldn&#039;t find out what it means. 
How can I go about finding out the meaning of this phrase?

By the way the writing reads &#039;dariguguseman tujuhjemaatyalomsawah l&#039;</description>
		<content:encoded><![CDATA[<p>I bought a scarf the other day and it has some writing on it. I&#8217;m really curious as to what it says because I Don&#8217;t want to wear it if its some sort of satanic rambling. By looking at the print on it you&#8217;d say it was turkish or iranian or sopmething but I tried to translate from a few languages and still couldn&#8217;t find out what it means.<br />
How can I go about finding out the meaning of this phrase?</p>
<p>By the way the writing reads &#8216;dariguguseman tujuhjemaatyalomsawah l&#8217;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: tnlessone</title>
		<link>http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-169</link>
		<dc:creator>tnlessone</dc:creator>
		<pubDate>Wed, 16 May 2007 08:18:56 +0000</pubDate>
		<guid isPermaLink="false">http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-169</guid>
		<description>Hi reto!
Thanks for sharing your experience! It&#039;s always interesting to see the domains language detection can be applied to and the solutions chosen.
And sorry for the comment blocking... It seems as if Askimet is very restrictive at... spam-freeing comments.
It even blocks mine...</description>
		<content:encoded><![CDATA[<p>Hi reto!<br />
Thanks for sharing your experience! It&#8217;s always interesting to see the domains language detection can be applied to and the solutions chosen.<br />
And sorry for the comment blocking&#8230; It seems as if Askimet is very restrictive at&#8230; spam-freeing comments.<br />
It even blocks mine&#8230;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: reto</title>
		<link>http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-165</link>
		<dc:creator>reto</dc:creator>
		<pubDate>Tue, 15 May 2007 23:31:53 +0000</pubDate>
		<guid isPermaLink="false">http://tnlessone.wordpress.com/2007/05/13/how-to-detect-which-language-a-text-is-written-in-or-when-science-meets-human/#comment-165</guid>
		<description>Your comments are very much appreciated!
I myself started to ponder over language-detection using n-grams, when I wondered how DivX subtitle that can be uploaded to 

http://www.opensubtitles.org/en


are detected and how the range of detectable languages could be increased.

I found a PERL version by Gertjan van Noord&#039;s TextCat 
http://www.let.rug.nl/~vannoord/TextCat/

and a PYTHON version by Thomas Mangin.
http://thomas.mangin.me.uk/software/python.html

but I had difficulties to encounter a PHP version...and that&#039;s why I did it myself at
http://boxoffice.ch/pseudo/index.php

Since I&#039;m definitely not experienced in writing any code I was surprised that three small functions together with the finger-prints of the languages do the job. There was no big issue about Asian (unicode) language-detection, since the php string funtiction go obiously bytewise (unless not told differently).

I put the code, demo, finger-prints and more on my website as well. It might be instructive for some.</description>
		<content:encoded><![CDATA[<p>Your comments are very much appreciated!<br />
I myself started to ponder over language-detection using n-grams, when I wondered how DivX subtitle that can be uploaded to </p>
<p><a href="http://www.opensubtitles.org/en" rel="nofollow">http://www.opensubtitles.org/en</a></p>
<p>are detected and how the range of detectable languages could be increased.</p>
<p>I found a PERL version by Gertjan van Noord&#8217;s TextCat<br />
<a href="http://www.let.rug.nl/~vannoord/TextCat/" rel="nofollow">http://www.let.rug.nl/~vannoord/TextCat/</a></p>
<p>and a PYTHON version by Thomas Mangin.<br />
<a href="http://thomas.mangin.me.uk/software/python.html" rel="nofollow">http://thomas.mangin.me.uk/software/python.html</a></p>
<p>but I had difficulties to encounter a PHP version&#8230;and that&#8217;s why I did it myself at<br />
<a href="http://boxoffice.ch/pseudo/index.php" rel="nofollow">http://boxoffice.ch/pseudo/index.php</a></p>
<p>Since I&#8217;m definitely not experienced in writing any code I was surprised that three small functions together with the finger-prints of the languages do the job. There was no big issue about Asian (unicode) language-detection, since the php string funtiction go obiously bytewise (unless not told differently).</p>
<p>I put the code, demo, finger-prints and more on my website as well. It might be instructive for some.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
