Data extraction Information extracted from sources Quantification and categorization of extracted data Produced datasets