automated process of selecting and analyzing large amounts of text or data resources for purposes such as searching and semantic analysis