process of separating and organizing data into relevant groups ("classes") based on their shared characteristics