There are 102 datasets tagged with linkeddata:
-
Data exposed: GO annotations from National Center for Biotechnology Information (NCBI) and European Bioinformatics Institute (EBI) Size of dump and data set: 73 MB Openness Derived...
-
Data exposed: FlyAtlas and Affy D2 probe-to-gene Size of dump and data set: size? Notes: also found in the of SPARQL Endpoints
-
About Data exposed: provides daily generated dumps with all its DOAP project descriptions Size of dump and data set: size? Notes: 2009-05-24: Both files seem to be empty - hg...
-
Data exposed: what? Size of dump and data set: 626 KB Notes: NCBI Copyright and Disclaimers
-
Data exposed: DMOZ Size of dump and data set: size? Openness: OPEN (?) Use Open Directory License which is, in essence, open (may be some wrinkles about updates).
-
Data exposed: collaborative file describing service Size of dump and data set: 330,026 discrete files, 270MB uncompressed
-
Data exposed: — Size of dump and data set: size? Notes: this is the classic RDF source but historically has had some problems with RDF correctness.
-
Data exposed: at least 42,000 famous quotations with author and subject Size of dump and data set: size?
-
Duplicate of package:freebase Data exposed: Freebase Views of Freebase Topics following the principles of Linked Data. The dataset extractions contain aggregated data from: Wikipedia,...
-
Status Note: the data does not appear to have been updated since March 2006. About From website: Wikipedia³ is a conversion of the English Wikipedia into RDF. It's a monthly updated...
-
Data exposed: derived from data published by www.fly-ted.org and provides metadata on images depicting in situ hybridisation in D. melanogaster testes. Size of dump and data set: size?...
-
Duplicate of package:twi-logd
-
About Data exposed: NLM 2007 MeSH Size of dump and data set: 13 MB Notes: MeSH MOU Openness Appears to be in public domain. Copyright pages states: Government information at NLM...
-
About Data exposed: various data sets including CIA's World Factbook, Library of Congress' Thesaurus of Graphic Materials, National Cancer Institute's cancer thesaurus, Web Consortium's...
-
Data exposed: 290344 restaurants - 104856 reviews - 59243 links to reviews - 2402 editors Size of dump and data set: size? Openness: OPEN Available under Open Directory License.
-
About Data exposed: BAMS Size of dump and data set: 5.6 MB Notes: 2009-05-24: File does not exist - hg / Health Care and Life Sciences Interest Group (HCLSIG) / National Institute...
-
Data exposed: Entrez Gene Extract from [ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene_info.gz] Size of dump and data set: 5.6 MB Notes: NCBI Copyright and Disclaimers
-
in.fondo.al.mar (under the sea) is an info-visualisation project about a series of sinkings and incidents in the Mediterranean Sea, involving ships which are suspected of having carried...
-
Data exposed: Galen from co-ode.org Size of dump and data set: 1.9 MB Notes: released without contract Openness: ? No license specified on home page though generic (restrictive)...
-
Data exposed: metadata extracted from Wikipedia Size of dump and data set: 47 million triples
-
Data exposed: data exposed? Size of dump and data set: expands to about 1.3GB
-
This dataset consists solely of triples linking items in one Linked Data dataset with items in another. The triples are generated as part of the LATC project, using the SILK Linking...
-
Data exposed: Addgene catalog (tab delimited file) Size of dump and data set: 1.1 MB Notes: provided to Science Commons by Addgene Openness: ? T&C here:...
-
Digital Object Idenfiers (DOI) are a persistent identifier strategy used by around 3,000 publishers to identify their documents: mostly scholarly publications. An example of a DOI is...
-
Contractors and suppliers of the Senate of the Italian Republic in 2010. More details are available at http://www.linkedopendata.it/datasets/los The ontology is documented at...
-
Data exposed: Extracted from 2007 Medline baseline distribution Size of dump and data set: 670 MB Notes: contact Medline for use terms
-
About Data exposed: List of all associations of MeSH headings to papers indexed by Medline extracted from 2007 Medline baseline distribution Size of dump and data set: 758 MB Notes:...
-
About The complete dataset is composed of a set of smaller datasets. Each download is in one of two formats: (1) WARC or (2) tar.gz. You can read about the WARC format by following this...
-
Data exposed: selected OBO ontologies, downloaded ~21 April 2007, augmented with inferred relations Size of dump and data set: 2.6 MB Notes: released without contract
-
Data exposed: All OBO ontologies Size of dump and data set: 36 MB Notes: Berkeley Bioinformatics Open-source Projects(BBOP); released without contract
-
About Data exposed: Select fields from Entrez Gene records Size of dump and data set: 7.7 MB Notes: NCBI Copyright and Disclaimers Openness Data appears to be in public domain....
-
Data exposed: NLM 2007 MeSH descriptor/qualifier pairs Size of dump and data set: 13 MB Openness: OPEN See http://www.nlm.nih.gov/mesh/termscon.html (basically attribution with...
-
Data exposed: A bridging ontology, from Science Commons, importing other ontologies used in the prototype, defining classes and relations used to represent gene records and their...
-
About The Allen Mouse Brain Atlas is an interactive, genome-wide image database of gene expression. Data exposed: Science Commons extract from ABA Web site, on or shortly before 26 Feb...
-
Dataset that was used for the Billion Triples Challenge 2010: See: http://challenge.semanticweb.org/ The major part of the dataset was crawled from the Web of Linked Data during...
-
Data exposed: SKOS representation of the RAMEAU book indexing vocabulary, maintained by the French National Library (BnF) Size of dump and data set: 130 MB uncompressed Notes:...
-
This is an RDF representation of the personal name authorities in the BIBSYS authority file, the dataset was created with funding from the Norwegian Archive, Library and Museum Authority...
-
This package is a collection and common access point to a group of Linked Data datasets related to Scotland. It was set up as an initiative of the Scottish Linked Data interest group and...
-
This dataset contains the main Index of Multiple Deprivation scores for 2010. It was created from data provided by the Department of Communities and Local Government. See here for...
-
This dataset provides information on the Lower Layer Super Output Areas for England and Wales, derived from information from the Office of National Statistics. The data is in the form of...
-
This package is a collection and common access point to a group of Linked Data datasets representing the English Indices of Multiple Deprivation data.
-
This dataset contains the main Index of Multiple Deprivation rankings for 2010. The regions (Lower Super Output Areas) are ordered according to their IMD score, with a rank of 1...
-
Data exposed: ontology focused on bibliography data of publications from DBLP with additions that include affiliations, universities, and publishers Size of dump and data set: 11M...
-
t4gm.info is a Linked Data rendering in RDFa and SKOS of the Library of Congress' Thesaurus for Graphic Materials. t4gm.info predates the Library of Congress exposure of TGM thesaurus...
-
About Now it is even easier to use the rich and diverse collection of real-world concepts in OpenCyc to bring meaning to your semantic web applications! The full OpenCyc content is now...
-
The Thesaurus for the Social Sciences (Thesaurus Sozialwissenschaften) contains about 11,600 entries, of which more than 7,750 are descriptors (authorised keywords) and about 3,850...
-
About Data exposed: Linguistic Data Size of dump and data set: ~40MB Openness Download dump: CC-BY-SA 3.0 license The web service additionally provides some parts that are not fully...
-
The thesaurus provides vocabulary on any economic subject: about 6,000 standardized subject headings and about 18,000 entry terms to support individual keywords. You can also find...
-
The MIUR is the Italian Ministry of Education, University and Research and each year publishes a set of useful information about the University student data. The LOIUS project...
-
A simple service that takes a Linked Data URI and gives back other URIs that may be the same Thing. Format of return can be in rdf+xml, rdf+n3, JSON or plain text. The data is a filtered...
-
Data exposed: 45 different domains, each with a separate data set. The data sets are focused on scientific research; these include DBLP, Citeseer, CORDIS, NSF, EPSRC, RAE2001, KISTI,...
-
The data presented here is a linked data representation of the street-level crime reports first released for England and Wales in 2011. Initial data exports cover December 2010, with...
-
Duplicate of package:2000-us-census-rdf
-
The Combined Nomenclature 2012 is a product scheme classification used to extract statistics. Authors: Jose María Alvarez Rodríguez & Jose Emilio Labra Gayo WESO-University of Oviedo
-
The Common Procurement Vocabulary (CPV) establishes a single classification system for public procurement aimed at standardising the references used by contracting authorities and...
-
This dataset created by the SADEI contains information about the populated places of my area, Asturias, including: -Codes to identify the type of a populated place: CC/PP/EE (C: code of...
-
The Statistical classification of products by activity (CPA) is a product scheme classification used to extract statistics. It was a previous attempt to CPV. Authors: -Jose María...
-
The Central Product Classification (CPC) is a product scheme classification used to extract statistics. It was a previous attempt to CPV. Authors of the linked data version: -Jose...
-
International Standard Industrial Classification of All Economic Activities, United Nations Statistics Division is a product scheme classification used by the United Nations to create...
-
The Standard International Trade Classification V4 is used by the United Nations to create statistics. Authors of the linked data version: -Jose María Alvarez Rodríguez & Jose Emilio...
-
The North American Industry Classification System (NAICS) is the standard used by Federal statistical agencies in classifying business establishments for the purpose of collecting,...
-
The Common Procurement Vocabulary (CPV) establishes a single classification system for public procurement aimed at standardising the references used by contracting authorities and...
-
This dataset contains several product scheme classifications that have been transformed to linked data. This dataset is a “catalogue” dataset; the individual classifications are:...
-
The North American Industry Classification System (NAICS) is the standard used by Federal statistical agencies in classifying business establishments for the purpose of collecting,...
-
2000 U.S. Census converted into over a billion RDF triples. Population statistics at various geographic levels, from the U.S. as a whole, down through states, counties, sub-counties...
-
Data exposed: corporate ownership Size of dump and data set: 1.8 million triples Notes: also found in the of SPARQL Endpoints
-
-
-
-
-
Data exposed: Traditional Chinese medicine, gene and disease association dataset and a linkset mapping TCM gene symbols to Extrez Gene IDs created by Neurocommons Size of dump and data...
-
About Data exposed: Yale Senselab Size of dump and data set: 216 KB Notes: released without contract The Semantic Web development of SenseLab involves exporting data from NeuronDB,...
-
Weather forecast data screenscraped from pages like http://www.metoffice.gov.uk/weather/uk/os/kirkwall_forecast_weather.html and converted to Linked Data New forecasts for every area in...
-
About Data exposed: Metadata (papers, presentations, people) for several semantic web related conferences and workshops, including the most recent ISWC, ESWC and WWW events. Notes: The...
-
List of postal codes in Italy. Includes street names, city and administrative regions.
-
List of accommodations in Piedmont, Italy. The dataset uses GoodRelations and vcard and includes addresses, contact information (where available) and geo-reference. Note: geo-reference is...
-
List of accommodations in Tuscany, Italy. The dataset uses GoodRelations and vcard and includes addresses, contact information (where available) and geo-reference. Note: geo-reference...
-
Linked Data from brazilian politicians including personal data, election data, disclosure of assets, parliamentary data, leaderships, missions, mandates, clearances, speeches,...
-
About Data exposed: (used by output of MeSH to SKOS conversion) Size of dump and data set: 2.2 KB Notes: released without contract Openness Copyright notice: Integrated Public...
-
RDF conversion of Princeton's package:wordnet, version 3.0. With many links to package:w3c-wordnet, package:lexvo and the Dutch package:cornetto.
-
A lightweight, reference structure of 28,000 subject concepts for the Web. UMBEL is jointly developed and maintained by Structured Dynamics LLC and Ontotext AD. There is a total of...
-
Este nuevo recurso informático que la Dirección General de Libro, Archivos y Bibliotecas pone a disposición del ciudadano supone un importante avnace tecnologico pueste que implica la...
-
OPAC and Digital Library and the corresponding authority data as Linked Open Data. The used vocabularies are * RDFDC for bibliographic data, * FOAF for name authority entries, and *...
-
Dewey.info is an experimental space for linked DDC data. The intention of the dewey.info prototype is to be a platform for Dewey data on the Web. Included as linked data are the DDC...
-
Data exposed: Linked Clinical Trials Size of dump and data set: ~25 million triples as of April 2011. 4.8GB NTriples dump CC by-nc-sa license You are free to copy, distribute,...
-
Data exposed: Linked Data about Movies Size of data set: 6,148,121 triples. Openness: Open Mixture of material from Wikipedia, Freebase and Geonames and states on...
-
Data exposed: Information on Biological Orders, Families, Species as well as species occurrence records and related data The data set currently contains information and linked data for:...
-
Data exposed: various dumps Size of dump and data set: 1 billion triples
-
Dutch lexical database, similar to WordNet but with more semantic relations. Links to package:vu-wordnet and package:w3c-wordnet. When this dataset is used for research purposes,...
-
Description Data exposed: Information about airports, originally from package:ourairports, here re-published as RDF. Notes: Dump available by contact Issues The dataset does not appear...
-
This dataset aims at publishing the contents of Hungarian archives as Linked Open Data based on the National Digital Data Archive of Hungary. The dataset contains information about books,...
-
Data exposed: (for New Testament Names) is a semantic knowledge base describing each named thing in the New Testament Size of dump and data set: about 600 names NTNames base URI...
-
Data exposed: machine readable dictionary derived from WordNet 2.1, Wiktionary, the CMU Pronouncing Dictionary and the OpenCyc lexicon. Each lexicon word sense entry contains links back...
-
About Data exposed: a large life sciences data set Size of dump and data set: 3000M+ triples Openness Not open. Copyright page states: Copyright 2007-2012 UniProt Consortium. We...
-
duplicate of package:twc-logd
-
About YAGO is a huge semantic knowledge base. Currently, YAGO knows more than 2 million entities (like persons, organizations, cities, etc.). It knows 20 million facts about these...
-
Description The package holds data from package:jamendo converted to RDF, available under the same license than the raw Jamendo data itself. The package also holds links towards Geonames...
-
Magnatune is an independent music label, allowing people to buy records for as much as they want. This package contains the Magnatune catalog in RDF format. The converted RDF data is...
-
RDF conversion of a dataset released by the BBC, about the John Peel sessions, a long-lived series of live music performances on BBC Radio 1, hosted by DJ John Peel.
-
List of geo-referenced italian museums. Places are linked to Geonames. Museum categories are linked to dbpedia. More info at http://www.linkedopendata.it/datasets/musei
-
Contains addresses, type, contacts and other info about more 50.000 public schools in italy. The dataset is currently in alpha stage: its quality needs to be improved and schools are not...
-
Contractors and suppliers of the chamber of deputies in italy in 2010. More details are available at http://www.linkedopendata.it/datasets/loc The ontology is documented at...