Image Datasets

  • ImageNet is an ongoing research effort to provide researchers around the world an easily accessible image database.
  • THE MNIST DATABASE of handwritten digits.
  • Review Dataset

  • Amazon product data This dataset contains product reviews and metadata from Amazon, including 142.8 million reviews spanning May 1996 - July 2014.
  • Movie Review Data movie-review data for use in sentiment-analysis experiments.
  • Large Movie Review Dataset a set of 25,000 highly polar movie reviews for training, and 25,000 for testing.
  • MovieLens 20 million ratings and 465,000 tag applications applied to 27,000 movies by 138,000 users. Includes tag genome data with 12 million relevance scores across 1,100 tags.
  • Amazon movie reviews The data span a period of more than 10 years, including all ~8 million reviews up to October 2012. Reviews include product and user information, ratings, and a plaintext review.
  • Biological Science Database

  • StringDB database for protein
  • Ensembl.org genomes that supports research in comparative genomics, evolution, sequence variation and transcriptional regulation.
  • NCBI The National Center for Biotechnology Information advances science and health by providing access to biomedical and genomic information.
  • BioGRID Database of Protein, Chemical, and Genetic Interactions