Skip to main content

English News Text Treebank: Penn Treebank Revised

Resource
URL
https://dss2.princeton.edu/data/301/
Blurb

English News Text Treebank: Penn Treebank Revised was developed by the Linguistic Data Consortium (LDC) with funding through a gift from Google Inc. It consists of a combination of automated and manual revisions of the Penn Treebank annotation of Wall Street Journal (WSJ) stories. The data is comprised of 1,203,648 word-level tokens in 49,191 sentence-level tokens -- in all 2,312 of the original Penn Treebank WSJ files.

 

Link time
2020-05-21 19:25:00 UTC
Sample
Principal investigator
Producer
Distributor
Version
More detail URL
Resource type
Single study
Subjects
  • Art & Culture
Regions
    Countries
    • United States