We report on a progressing work for compiling Quora question answer dataset. Quora dataset is composed of the questions which are posed in Quora question answering site.
It is the only dataset which provides answers in sentence level and word level at the same time. Moreover, the questions in the dataset are authentic which is much more realistic for question answering systems.
We test the performance of a state-of-the-art question answering system on the dataset and compare it with human performance to establish an upper bound for the dataset