About
The complete dataset is composed of a set of smaller datasets. Each download is in one of two formats: (1) WARC or (2) tar.gz. You can read about the WARC format by following this link to the mailing list. The tar.gz format is a tarred and gzipped file containing triples given in the N-Triples syntax.
Data exposed: extracted from Temis software applied to 7% of Medline records Size of dump and data set: 24 MB Notes: released without contract
Openness
Data is comprised of other datasets - most of which are open.
Resources
http://purl.org/hcls/2007/kb-sources/neurocommons-text-mining.tgz [downloaded 2 times]
Example resource
http://sw.neurocommons.org/2007/annotations#gene-or-gene-product
Additional Information
| Field | Value |
|---|---|
| Source | http://sw.neurocommons.org/2007/text-mining.html |
| Author | <URI> |
| Maintainer | Maintainer not given |