Список литературы:
- Cafarella M. J., Etzioni O. A Search Engine for Natural Language Applications // WWW Conference. 2005. P. 442-452.
- Chakrabarti S., Punera K., Subramanyam M. Accelerated Focused Crawling through Online Relevance Feedback // Ibidem. 2002.
- Chaudhuri S., Ganjam K., Ganti V., Motwani R. Robust and Efficient Fuzzy Match for Online Data Cleaning // SIGMOD. 2003.
- Cho J., Rajagopalan S. A Fast Regular Expression Indexing Engine // ICDE. 2002. P. 419-430.
- DeRose P., Shen W., F. C. 0002, Lee Y., Burdick D., Doan A., Ramakrishnan R. DBLife: a Community Information Management Platform for the Database Research Community (Demo) // CIDR. 2007. P. 169-172.
- Grishman R., Huttunen S., Yangarber R. Information Extraction for Enhanced Access to Disease Outbreak Reports // Journal of Biomedical Informatics. 2002. Vol. 35. P. 236-246.
- http://ru.wikipedia.org/wiki/TF-IDF
- Ipeirotis P. G., Agichtein E., Jain P., Gravano L. Towards a Query Optimizer for Text-Centric Tasks // ACM Transactions on Database Systems. 2007. Vol. 32.
- Kim M.-S., Whang K.-Y., Lee J.-G., Lee M.-J. N-Gram/2l: a Space and Time Efficient Two-Level N-Gram Inverted Index Structure // VLDB'05: Proceedings of the 31st International Conference on Very Large Data Bases. 2005. P. 325-336.
- Resnik P., Elkiss A. The Linguist's Search Engine: an Overview (Demonstration) // ACL. 2005.
|