ImageNet is an ongoing research effort to provide researchers around the world an easily accessible image database.
THE MNIST DATABASE of handwritten digits.
Biological Science Database
StringDB database for protein
Ensembl.org genomes that supports research in comparative genomics, evolution, sequence variation and transcriptional regulation.
NCBI The National Center for Biotechnology Information advances science and health by providing access to biomedical and genomic information.
BioGRID Database of Protein, Chemical, and Genetic Interactions
Amazon product data This dataset contains product reviews and metadata from Amazon, including 142.8 million reviews spanning May 1996 - July 2014.
Movie Review Data movie-review data for use in sentiment-analysis experiments.
Large Movie Review Dataset a set of 25,000 highly polar movie reviews for training, and 25,000 for testing.
MovieLens 20 million ratings and 465,000 tag applications applied to 27,000 movies by 138,000 users. Includes tag genome data with 12 million relevance scores across 1,100 tags.
Amazon movie reviews The data span a period of more than 10 years, including all ~8 million reviews up to October 2012. Reviews include product and user information, ratings, and a plaintext review.