estimating information about an item (e.g. languages spoken) based on language used in a published work