a (pre-)processing set of techniques that makes the data fit on a certain knowledge representation needed to apply intelligent data analysis techniques; these set of techniques are concerned with fitting complete and correct data to a certain knowledge representation that may require only numerical/nominal data, with certain intervals or text patterns, be structured in a certain way(graph, database table, facts) etc.