On Tue, Dec 02, 2003 at 11:25:41AM +0000, Jack Bertram wrote: > * David Yitzchak Cohen <lists+mutt_users@xxxxxxxxxxxxxx> [031202 11:16]: > > I prefer to use a heuristic (e.g. spell checking) to determine > > what the "primary" language of the original mail was and/or to determine > > what the "primary" language of my own email is. Either way, the test > > itself would be abstracted into something like a what_language_is_this > > script, which you could replace with anything you want (including a > > simple test of the charset). > > You could probably and profitably check out ifile, which is a general > Bayesian text classifer by Jason Rennie. Some people on the ifile mailing > list have reported success at using it to distinguish between emails > in different languages. Well, that'd certainly be another option for the what_language_is_this script ... or we can have each of these algorithms "vote" for a language, and use some voting algorithm to decide which language to assume. I really lack the time to setup such a system, though. Thanks for the suggestion, anyway, though, - Dave -- Uncle Cosmo, why do they call this a word processor? It's simple, Skyler. You've seen what food processors do to food, right? Please visit this link: http://rotter.net/israel
Attachment:
pgpKB8QsIyV3u.pgp
Description: PGP signature