Wednesday, June 13, 2012

Berkeley language model and Google Web 1T language model

Berkeley language model provides a library for estimating storing large n-gram language models in memory and accessing them efficiently. The most amazing contribution of it is that it can be used with the Google Web 1T language model, and it also provides the binary Web 1T language models for many languages:
English
Chinese
Czech
Dutch
Frenchh
German
Italian
Polish
Portuguese
Romanian
Spanish
Swedish

The homepage of the Berkeley language model project is here, and you can find the binary language models of the Google Web 1T here.

No comments: