From: ericzen (ericzen@ez-net.com)
Date: Fri Jul 18 2003 - 13:16:18 EDT
On 2003.07.18 02:38 Andrew Dunbar wrote:
>
> Just be very sure that only non-copyrighted material
> is included, or that permission is granted by the
> copyright owners.
>
> Anyway at the risk of overstating, we don't so much
> want the most common words as we want the functional
> words.
Why would it matter?
I'll put eighty bucks down that Michael Crichton doesn't care if his "to"s are in the dictionary or if Wikipedia's "to"s are in the dictionary. Granted, if "Jurassic" shows up in the dictionary (like your second point), we might have more serious issues....
I, personally, would like to think that the out put will be hand-eyeballed, which would prevent the likes of "time" from entering the dictionary (relatively common) and ensure that "this" is present.
Wingerdy dingerdy
Eric
This archive was generated by hypermail 2.1.4 : Fri Jul 18 2003 - 13:26:47 EDT