What is a corpus?

 

 
A corpus is a sample of written or oral language which is strictly collected as empirical bases for language research. 
 
One definition of corpus is given by:  

"It´s a structured collection of language data capturing spoken ELF interactions which provides a basis for such a description." (VOICE corpus)