Resource: New York Times Annotated Corpus

Data Files

Persistent Identifier (PID) of this digital object: https://hdl.handle.net/11022/0000-0000-2CC1-5
Resource landing page: https://hdl.handle.net/11022/0000-0000-2CC1-5
Packaged files for this dataset:

This data set contains the following subordinate data objects:

This data set contains the following data streams:

General Information

Resource Name: New York Times Annotated Corpus
Resource Title: New York Times Annotated Corpus
Resource Class: Corpus
Version: archived
Life Cycle Status:
Start Year:
Completion Year:
Publication Date: 2008
Last Update:
Time Coverage:
Legal Owner:
Genre: unspecified
Field of Research:
Location: Deutschland Germany
Description:

English: The New York Times Annotated Corpus contains over 1.8 million articles written and published by the New York Times between January 1, 1987 and June 19, 2007 with article metadata provided by the New York Times Newsroom, the New York Times Indexing Service and the online production staff at nytimes.com.

Tags:
Modality Info: written

Project

Project Name: SFB 833 INF
Project Title: Heterogene Forschungsprimärdaten des SFB 833 – Repräsentation und Verarbeitung Heterogenous Primary Research Data of the SFB 833 - Representation and Processing
Project ID: 75650358
Url: https://www.sfb833.uni-tuebingen.de/infrastrukturprojekt-inf.html?type=0
Funder: Deutsche Forschungsgemeinschaft (DFG)
Institution: Sonderforschungsbereich 833: Bedeutungskonstitution- Dynamik und Adaptivität sprachlicher Strukturen SFB 833: The construction of meaning - the dynamics and adaptivity of linguistic structures Other: Eberhard Karls Universität Tübingen

Cooperations:
Person(s): Erhard Hinrichs
Descriptions:
Duration:

Creation

Topic:
Creator(s):
Source:
Original Source New York Times (Newswire)
Catalogue Link:
Type:
Format:
Size:
Quality:
Description:
Derivation:
Organisation(s) Linguistic Data Consortium
Derivation Date
Derivation Mode(s)
Derivation Type(s) Information extraction, Information retrieval, Metadata extraction, Summarization
Derivation Workflow(s)
Derivation Tool Info

Documentations

Documentation Type(s):
File Name(s):
URL:
Documentation Language(s): English
Descriptions(s):

Text Corpus

Corpus Type: specialised corpus
Temporal Classification:
Description(s):
Validation:
Subject Language(s):
Type-specific Size Info:

Access

Availability: request required
Distribution Medium: Download One DVD
Catalogue Link:
Price:
Licence: Linguistic Data Consortium, The Trustees of the University of Pennsylvania.
Contact: Thorsten Trippel (Archive Manager) , e-mail:thorsten.trippel@uni-tuebingen.de
Deployment Tool Info:
Descriptions:

This digital object contains:
Original File Name Size Checksums
NYTAnnotatedCorpus.zip 3233831237B
  • 2b7685aab3557dd88ae18e480a41ff93070036fe (SHA1)

About

Generation: Automatically generated with an XSL stylesheet from the CMDI file, v.02
Contact: Thorsten Trippel and Claus Zinn, SfS Tuebingen

(): New York Times Annotated Corpus

Persistent identifier: https://hdl.handle.net/11022/0000-0000-2CC1-5

This resource is provided through the technology partnership with the Tübingen Archive of Language Resources