Datasets


 

Add Your Site
     No more subcategories.


1.
The 20 Newsgroups Data Set  (site info)
20 Newsgroups for text categorization. Widely used dataset.
http://www.ai.mit.edu/~jrennie/20_newsgroups/
Preview by Thumbshots



2.
Penn Treebank Project  (site info)
A corpus of parsed sentences. Used by many researchers for training data-driven parsing algorithms.
http://www.cis.upenn.edu/~treebank/
Preview by Thumbshots



3.
Web->KB dataset  (site info)
Web pages partitioned into classes, with hyperlink data. The dataset has been used for text categorization and learning to extract symbolic knowledge from the World Wide Web.
http://www.cs.cmu.edu/afs/cs.cmu.edu/project/theo-...
Preview by Thumbshots



4.
Face recognition dataset  (site info)
A dataset of face images for face recognition algorithms.
http://www.cs.cmu.edu/afs/cs.cmu.edu/user/avrim/ww...
Preview by Thumbshots



5.
WordSimilarity-353 Test Collection  (site info)
Contains 353 English word pairs along with human-assigned similarity judgements.
http://www.cs.technion.ac.il/~gabr/resources/data/...
Preview by Thumbshots


1-5 of 23. Next »


Free thumbnail preview by Thumbshots.org

Copyright © 2002, 2004, 2006 CategoryWEB, LLC. All Rights Reserved. about CategoryWEB | Add Your Site