This paper addresses a problem of knowledge discovery in big data from the point of view of theoretical computer science. Contemporary characterization of big data is often preoccupied by its volume, velocity of change, and variety that causes technical difficulties to handle the data efficiently while theoretical challenges that are offered by big data are neglected at the same time.