Data sampling Data Engineering
noun phrase
Definition: A general process of selecting a subset of data from a larger dataset for analysis, model development, evaluation, or efficiency-oriented processing. The term is commonly used as an umbrella designation for methods that reduce, balance, or structure data prior to or during computational analysis [IBM].
Example in context: “An exciting direction for future research could be to develop dynamic policies for data sampling that automatically adapt to diverse applications.” [Akujuobi et al. 2024]
Synonym: sampling
Related terms: random sampling, stratified sampling, subsampling