加入收藏 | 设为首页 | 会员中心 | 我要投稿 李大同 (https://www.lidatong.com.cn/)- 科技、建站、经验、云计算、5G、大数据,站长网!
当前位置: 首页 > 大数据 > 正文

学习机器学习 数据处理时 找到的这些链接 可以在上面下载到开源

发布时间:2020-12-14 03:54:22 所属栏目:大数据 来源:网络整理
导读:美国政府数据 http://www.data.gov/ Movies Recommendation : MovieLens - Movie Recommendation Data Sets http://www.grouplens.org/node/73 Yahoo! - Movie,Music,and Images Ratings Data Sets http://webscope.sandbox.yahoo.com/catalog.php?datatype=

美国政府数据 http://www.data.gov/


Movies Recommendation:

  • MovieLens - Movie Recommendation Data Sets http://www.grouplens.org/node/73

  • Yahoo! - Movie,Music,and Images Ratings Data Sets http://webscope.sandbox.yahoo.com/catalog.php?datatype=r

  • Jester - Movie Ratings Data Sets (Collaborative Filtering Dataset) http://www.ieor.berkeley.edu/~goldberg/jester-data/

  • Cornell University - Movie-review data for use in sentiment-analysis experiments http://www.cs.cornell.edu/people/pabo/movie-review-data/

Music Recommendation:

  • Last.fm - Music Recommendation Data Sets http://www.dtic.upf.edu/~ocelma/MusicRecommendationDataset/index.html

  • Yahoo! - Movie,and Images Ratings Data Sets http://webscope.sandbox.yahoo.com/catalog.php?datatype=r

  • Audioscrobbler - Music Recommendation Data Sets http://www-etud.iro.umontreal.ca/~bergstrj/audioscrobbler_data.html

  • Amazon - Audio CD recommendations http://131.193.40.52/data/

Books Recommendation:

  • Institut für Informatik,Universitt Freiburg - Book Ratings Data Sets http://www.informatik.uni-freiburg.de/~cziegler/BX/

Food Recommendation:

  • Chicago Entree - Food Ratings Data Sets http://archive.ics.uci.edu/ml/datasets/Entree+Chicago+Recommendation+Data

Merchandise Recommendation:

  • Amazon - Product Recommendation Data Sets http://131.193.40.52/data/

Healthcare Recommendation:

  • Nursing Home - Provider Ratings Data Set http://data.medicare.gov/dataset/Nursing-Home-Compare-Provider-Ratings/mufm-vy8d

  • Hospital Ratings - Survey of Patients Hospital Experiences http://data.medicare.gov/dataset/Survey-of-Patients-Hospital-Experiences-HCAHPS-/rj76-22dk

Dating Recommendation:

  • www.libimseti.cz - Dating website recommendation (collaborative filtering) http://www.occamslab.com/petricek/data/

Scholarly Paper Recommendation:

  • National University of Singapore - Scholarly Paper Recommendation http://www.comp.nus.edu.sg/~sugiyama/SchPaperRecData.html


Information Network

  • DBLP http://www.informatik.uni-trier.de/~ley/db/

  • proximity DBLP http://kdl.cs.umass.edu/data/dblp/dblp-info.html

  • DBLP-Citation-Network http://arnetminer.org/citation

  • KDD-2011 http://www.cs.uiuc.edu/~hbdeng/data/kdd2011.htm

  • CiteSeer (hardly) http://csxstatic.ist.psu.edu/about/data

  • CiteSeer dumped http://martinharrigan.blogspot.com/2008/07/citeseers-dataset.html

  • Cora (hardly) http://people.cs.umass.edu/~mccallum/data.html

  • IMDB http://www.imdb.com/interfaces/

Social Network

  • Stanford large network dataset (contains lots of network dataset): http://snap.stanford.edu/data/

  • Stanford class resources http://snap.stanford.edu/na09/resources.html

  • ICWSM twitter dataset: http://twitter.mpi-sws.org/data-icwsm2010.html

  • EBSN - Event-based social network dataset: http://www.largenetwork.org/ebsn

  • Other social network dataset: Slashdot,Enron email,Mit mobile,Epinions reviews.

Sentiment and Option Mining

  • MPQA http://www.cs.pitt.edu/mpqa/index.html

  • Bing Liu's homepage

  • Movie Review http://www.cs.cornell.edu/people/pabo/movie-review-data/

  • Lee's homepage

  • twitter sentiment: http://www.sananalytics.com/lab/twitter-sentiment/

Recommendation

  • index1: https://gist.github.com/1653794

  • index2: http://mobblog.cs.ucl.ac.uk/datasets/

Machine Learning

  • UCI dataset http://archive.ics.uci.edu/ml/datasets.html

Audio Retrieval

  • CAL-500: http://twitterdata.org/

  • Million song dataset http://labrosa.ee.columbia.edu/millionsong/

Miscellaneous1

  • A lot graph dataset including several cups,twitter etc http://graphlab.org/downloads/datasets/

  • Several graph dataset http://law.di.unimi.it/datasets.php

  • Delicious/Flikr/Last.FM etc http://www.tagora-project.eu/data/

  • A small dataset about links http://www.cs.umd.edu/projects/linqs/projects/lbc/index.html

  • A small dataset including citeseerx/imdb http://komarix.org/ac/ds/

Miscellaneous2

Only user-object

  • Amazon

Both user-user and user-object

single-type user netwrok

  • Flickr,Youtube,twitter

signed user network

  • Epinion,Slashdot,Ciao

Multi-type user network

  • Facebook,Google plus

(编辑:李大同)

【声明】本站内容均来自网络,其相关言论仅代表作者个人观点,不代表本站立场。若无意侵犯到您的权利,请及时与联系站长删除相关内容!

    推荐文章
      热点阅读