There are 152 datasets tagged with lodcloud.nolinks:
-
Data exposed: GO annotations from National Center for Biotechnology Information (NCBI) and European Bioinformatics Institute (EBI) Size of dump and data set: 73 MB Openness Derived...
-
Data exposed: FlyAtlas and Affy D2 probe-to-gene Size of dump and data set: size? Notes: also found in the of SPARQL Endpoints
-
About Data exposed: provides daily generated dumps with all its DOAP project descriptions Size of dump and data set: size? Notes: 2009-05-24: Both files seem to be empty - hg...
-
Data exposed: what? Size of dump and data set: 626 KB Notes: NCBI Copyright and Disclaimers
-
Data exposed: DMOZ Size of dump and data set: size? Openness: OPEN (?) Use Open Directory License which is, in essence, open (may be some wrinkles about updates).
-
Data exposed: collaborative file describing service Size of dump and data set: 330,026 discrete files, 270MB uncompressed
-
Data exposed: — Size of dump and data set: size? Notes: this is the classic RDF source but historically has had some problems with RDF correctness.
-
Data exposed: at least 42,000 famous quotations with author and subject Size of dump and data set: size?
-
Status Note: the data does not appear to have been updated since March 2006. About From website: Wikipedia³ is a conversion of the English Wikipedia into RDF. It's a monthly updated...
-
Data exposed: derived from data published by www.fly-ted.org and provides metadata on images depicting in situ hybridisation in D. melanogaster testes. Size of dump and data set: size?...
-
About Data exposed: NLM 2007 MeSH Size of dump and data set: 13 MB Notes: MeSH MOU Openness Appears to be in public domain. Copyright pages states: Government information at NLM...
-
About Data exposed: various data sets including CIA's World Factbook, Library of Congress' Thesaurus of Graphic Materials, National Cancer Institute's cancer thesaurus, Web Consortium's...
-
Data exposed: 290344 restaurants - 104856 reviews - 59243 links to reviews - 2402 editors Size of dump and data set: size? Openness: OPEN Available under Open Directory License.
-
About Data exposed: BAMS Size of dump and data set: 5.6 MB Notes: 2009-05-24: File does not exist - hg / Health Care and Life Sciences Interest Group (HCLSIG) / National Institute...
-
Data exposed: Entrez Gene Extract from [ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene_info.gz] Size of dump and data set: 5.6 MB Notes: NCBI Copyright and Disclaimers
-
Data exposed: Galen from co-ode.org Size of dump and data set: 1.9 MB Notes: released without contract Openness: ? No license specified on home page though generic (restrictive)...
-
Data exposed: metadata extracted from Wikipedia Size of dump and data set: 47 million triples
-
Data exposed: data exposed? Size of dump and data set: expands to about 1.3GB
-
National raw and geodata catalog of the United States. See also package:twc-logd for an RDF version that features dereferenceable URIs and is more interlinked.
-
Dataset from University of Southampton Open Data Service. A catalog of known websites, web widgets, phone apps and other tools using our datasets.
-
The Prelinger Archives is a collection of films relating to U.S. cultural history, the evolution of the American landscape, everyday life and social history. It was physically located in...
-
The NAL Thesaurus is an online vocabulary tool of agricultural terms in English and Spanish and is cooperatively produced by the National Agricultural Library, USDA and the Inter-American...
-
This dataset contains metadata for about 400,000 nationally important places as recorded by English Heritage, the UK Government's statutory adviser on the historic environment. This...
-
This is an RDF snapshot of the Federal Reserve Bank of St. Louis's FRED2 database, which contains a variety of economic data.
-
UN Numbers are categories of harzardous materials. See http://en.wikipedia.org/wiki/UN_numbers for more details.
-
A lexicon from Pali to English.
-
A Lexicon of Sanskrit to English
-
RDF version of the ChemPedia substances data.
-
This has been screen scraped from the Glastonbury website.
-
The Linked Periodicals Database is a data set from the Data Incubator which aggregates journal metadata provided by CrossRef, Highwire Press and the National Library of Medicine....
-
PROD is a directory of JISC funded projects. Where possible it contains lists of the software and interoperability standards that these projects used, as well as links to other projects...
-
Data exposed: Addgene catalog (tab delimited file) Size of dump and data set: 1.1 MB Notes: provided to Science Commons by Addgene Openness: ? T&C here:...
-
Contractors and suppliers of the Senate of the Italian Republic in 2010. More details are available at http://www.linkedopendata.it/datasets/los The ontology is documented at...
-
About From website: The London Gazette, Official Newspaper of Record for the United Kingdom plays a major role in the information infrastructure for government, with 175,000 notices...
-
Data exposed: Extracted from 2007 Medline baseline distribution Size of dump and data set: 670 MB Notes: contact Medline for use terms
-
About Data exposed: List of all associations of MeSH headings to papers indexed by Medline extracted from 2007 Medline baseline distribution Size of dump and data set: 758 MB Notes:...
-
About The complete dataset is composed of a set of smaller datasets. Each download is in one of two formats: (1) WARC or (2) tar.gz. You can read about the WARC format by following this...
-
Data exposed: selected OBO ontologies, downloaded ~21 April 2007, augmented with inferred relations Size of dump and data set: 2.6 MB Notes: released without contract
-
Data exposed: All OBO ontologies Size of dump and data set: 36 MB Notes: Berkeley Bioinformatics Open-source Projects(BBOP); released without contract
-
About Data exposed: Select fields from Entrez Gene records Size of dump and data set: 7.7 MB Notes: NCBI Copyright and Disclaimers Openness Data appears to be in public domain....
-
Data exposed: NLM 2007 MeSH descriptor/qualifier pairs Size of dump and data set: 13 MB Openness: OPEN See http://www.nlm.nih.gov/mesh/termscon.html (basically attribution with...
-
Data exposed: A bridging ontology, from Science Commons, importing other ontologies used in the prototype, defining classes and relations used to represent gene records and their...
-
About The Allen Mouse Brain Atlas is an interactive, genome-wide image database of gene expression. Data exposed: Science Commons extract from ABA Web site, on or shortly before 26 Feb...
-
Dataset that was used for the Billion Triples Challenge 2010: See: http://challenge.semanticweb.org/ The major part of the dataset was crawled from the Web of Linked Data during...
-
This service exposes the data from openthesaurus.de as Linked Data.
-
Datos Abiertos Zaragoza es una iniciativa del Ayuntamiento de Zaragoza para el fomento de la reutilización de la información publicada en su web por parte de la ciudadanía, las empresas y...
-
The GeoNames Ontology makes it possible to add geospatial semantic information to the Word Wide Web. All over 6.2 million geonames toponyms now have a unique URL with a corresponding RDF...
-
The LOV dataset contains the description of RDFS vocabularies or OWL ontologies defined for and used by datasets in the Linked Data Cloud. Whenever available each vocabulary includes...
-
This is a linked data representation of the various geographical regions used by Scottish Neighbourhood Statistics, such as Census Output Areas, Data Zones, Intermediate Geographies, and...
-
This dataset provides information on the Lower Layer Super Output Areas for England and Wales, derived from information from the Office of National Statistics. The data is in the form of...
-
This package is a collection and common access point to a group of Linked Data datasets representing the English Indices of Multiple Deprivation data.
-
This is a linked data version of bus timetable data for Greater Manchester in the UK. It is based on open data in ATCO-CIF format, made available through Greater Manchester's open data...
-
-
Access problems As of 2010-09-30, the dataset is completely inaccessible (no DNS entry).
-
InterPro is a database of protein families, domains and functional sites in which identifiable features found in known proteins can be applied to new protein sequences in order to...
-
-
-
The Entrez Nucleotide database is a collection of sequences from several sources, including GenBank, RefSeq, and PDB. The number of bases in these databases continues to grow at an...
-
RDFized version of the CAS database (or parts of it) provided by the bio2rdf project. The usage policies of the original source are found at http://www.cas.org/legal/infopolicy.html...
-
Semantic web atlas of postgenomic knowledge
-
-
Access problems As of 2010-09-30, the dataset is completely inaccessible (no DNS entry).
-
-
Data exposed: ontology focused on bibliography data of publications from DBLP with additions that include affiliations, universities, and publishers Size of dump and data set: 11M...
-
SOCH is a set of 3.4 million (as of december 2010) cultural heritage objects harvested from a large number of museums and other local, regional and national cultural heritage...
-
Metadata about public domain works available at Project Gutenberg. Data last updated in 2007.
-
A partial RDF conversion of the CIA World Factbook, package:cia-world-factbook.
-
We are still in a development mode and working in particular and the namespace URI's and their hosting. So some elements here will change as we tweak responsibility for maintaining...
-
LDEO log files as data (TEST leg 218 only) We are still in a development mode and working in particular and the namespace URI's and their hosting. So some elements here will change as we...
-
About Now it is even easier to use the rich and diverse collection of real-world concepts in OpenCyc to bring meaning to your semantic web applications! The full OpenCyc content is now...
-
Farming statistics (farm sizes, land use, livestock) on local authority level, represented in RDF. This is a conversion of the June 2008 DEFRA survey on land use and livestock. The source...
-
Janus LOD is a test of exposing Janus data using a linked data application. http://data.oceandrilling.org/januslod/ Data in Janus comes from the Deep Sea Drilling Program and Ocean...
-
The FAO geopolitical ontology provides a master reference for geopolitical information, as it manages names in multiple languages (English, French, Spanish, Arabic, Chinese, Russian and...
-
RDF for artists, records, performances etc., generated from package:musicbrainz.
-
Overview of worldwide data catalogues.
-
The present data set contains all the versions of the NUTS statistical regions in linked data format. For the UK the NUTS3 level is further aligned to the local administrative units (or...
-
The Animal Diversity Web is an extraordinarily rich, multimedia natural history database with a high profile among educators and general audiences. We currently have 2,150 accounts of...
-
The RDF representation of TCGA was achieved by representing data elements from the TCGA dataset as statements from the S3DB Core Model (see S3DB Core Model for more information on the...
-
LOC (Linked Open Commerce) is a collaboration of OpenLink Software, Hepp Research GmbH and Linktegration that delivers a structured Linked Data space on the Web for finding products and...
-
This repository contains data from the UK's EPSRC Grants on the Web Data, but only up to 2003. Thus it contains data about many scientific researchers in the UK, and the projects they...
-
This domain is different from most of the other rkbexplorer domains. Its sole purpose is to publish the coreference data from OpenCyc provided by David Baxter, from Cycorp Inc.....
-
This is a store that contains the email archive of the University of Southampton Digital Economy mailing list. It mostly uses the SIOC ontology. The mbox2rdf translator was provided...
-
This site lists the ontology development work carried out during Stage 2 of the ResearchSpace Project by Seme4 Ltd. We present the CIDOC-CRM (RDFS, v5.0.2) with a modified namespace such...
-
-
-
A simple service that takes a Linked Data URI and gives back other URIs that may be the same Thing. Format of return can be in rdf+xml, rdf+n3, JSON or plain text. The data is a filtered...
-
Data exposed: 45 different domains, each with a separate data set. The data sets are focused on scientific research; these include DBLP, Citeseer, CORDIS, NSF, EPSRC, RAE2001, KISTI,...
-
Currently this site only contains a restricted set of information about popular fiction
-
The IPTC not only provides news exchange formats to the news industry but also creates and maintains sets of concepts to be assigned as metadata values to news objects like text,...
-
The bibliographic data from Acta Cryst E, a publication by the International Union of Crystallography (IUCr), has been extracted and made available with their consent. The data dump...
-
About One web page for every book ever published. It's a lofty, but achievable, goal. To build it, we need hundreds of millions of book records, a brand new database infrastructure for...
-
2000 U.S. Census converted into over a billion RDF triples. Population statistics at various geographic levels, from the U.S. as a whole, down through states, counties, sub-counties...
-
D2R Server publishing the DBLP Bibliography Database, hosted at L3S Research Center
-
The data is the catalogue records of the Mass Observation Archive, a Designated collection. The Mass Observation Archive is a written record of everyday life in Britain 1937-55. It's a...
-
About Data exposed: Yale Senselab Size of dump and data set: 216 KB Notes: released without contract The Semantic Web development of SenseLab involves exporting data from NeuronDB,...
-
A network of Linked Data that analyzes the tweets and user profiles of Twitter users who are registered in the Twitter archive Grabeeter. The tweets are annotated using popular...
-
Bricklink is an unofficial lego marketplace. Essentially it is the EBay for lego, where you can buy or sell anything to do with Lego. The lego community maintain a number of fantastic...
-
Foodista is a community edited recipe wiki, published under a Creative Common Attribution license. The wiki contains information on foods, tools, techniques, and recipes. This data has...
-
Publishing recipes as Linked Data will: help people find recipes by making them more discoverable allow recipes to be annotated and rated by the community...
-
List of postal codes in Italy. Includes street names, city and administrative regions.
-
About Data exposed: (used by output of MeSH to SKOS conversion) Size of dump and data set: 2.2 KB Notes: released without contract Openness Copyright notice: Integrated Public...
-
RDF-ized controlled vocabulary of Norwegian terms with Universal Decimal Classification numbers; library metadata from the Norwegian University of Science and Technology.
-
NGII is an open data service system based on an ontology in which spatial information and human geography information are integrated. Collecting and analyzing human geography information...
-
The General Finnish Thesaurus (YSA) contains general, commonly used terms. The thesaurus is widely used in Finnish libraries and other organisations.
-
Newsweek uses rdfa, foaf, in the articles describing authors and relationships. As the rss-feed is currently not working, they only provide their data in html, no other format. No...
-
lobid-organisations provides URIs for library-organisations.The URIs are based on the existing and well established International Standard Identifier for Libraries and Related...
-
Dewey.info is an experimental space for linked DDC data. The intention of the dewey.info prototype is to be a platform for Dewey data on the Web. Included as linked data are the DDC...
-
The Spanish National Library (Biblioteca Nacional de España, BNE) and the Ontology Engineering Group of Universidad Politécnica de Madrid are working on the joint project “Preliminary...
-
Offener Haushalt shows the complex data of several german budgets (federal, munich, berlin). It also gives access to the data in an open and re-usable format.
-
Bund Offener Haushalt shows the complex data of Berlin's budget. It also gives access to the data in an open and re-usable format. part of package:offener-haushalt
-
Bund Offener Haushalt shows the complex data of the german federal budget. It also gives access to the data in an open and re-usable format. part of package:offener-haushalt
-
DBpedia.org is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to ask sophisticated...
-
See also package:ordnance_survey. Published data Geographical data about England, Wales, and Scotland Provides identifiers for counties, cities, wards, census areas...
-
MeSH is the National Library of Medicine's controlled vocabulary thesaurus. It consists of sets of terms naming descriptors in a hierarchical structure that permits searching at various...
-
Description An API for Slideshare.net, it provides RDF metadata for the presentations uploaded to Slideshare. The RDF representation uses the SIOC ontology.
-
Data exposed: various dumps Size of dump and data set: 1 billion triples
-
This dataset contains almost all Dutch national regulations in the CEN MetaLex XML, RDF Linked Data and Pajek Network formats. Current coverage is in the order of 27k documents,...
-
Note: The number of triple is a wild guess based on the 2600 RDF documents found in Sindice and an assumption of ~20 triples per page.http://rdf.ecs.soton.ac.uk/ontology/ecs#
-
A list of most Senior Civil Service posts in the Scottish Government including title, contact details, their line manager, and where disclosed, the name of the officer. Vacant posts are...
-
Data exposed: (for New Testament Names) is a semantic knowledge base describing each named thing in the New Testament Size of dump and data set: about 600 names NTNames base URI...
-
The Facebook Linked Data Service (LIDS) is a Facebook wrapper which implements (some of) the public Facebook Graph API methods and makes them available for use with Linked Data.
-
Feedwrapper returns SIOC data for newsfeeds in RSS 1.0/2.0 and Atom feeds. Feedwrapper is based on the ROME RSS API.
-
The Twitter Linked Data Service (LIDS) is a wrapper for Twitter which implements (most of) the public Twitter API methods and makes them available for use with Linked Data. The wrapper...
-
Semantic Universe has begun producing linked data for its Enterprise Data World and Semantic Technology Conferences. With these as starting points, it is easy to start to navigate the...
-
LinkedMarkMail is a simple Linked Data interface for accessing the MarkMail archives.
-
From the web page about the INSEMTIVES project: INSEMTIVES is about bridging the gap between human and computational intelligence and providing incentives for users to contribute to the...
-
Data exposed: machine readable dictionary derived from WordNet 2.1, Wiktionary, the CMU Pronouncing Dictionary and the OpenCyc lexicon. Each lexicon word sense entry contains links back...
-
Linked data for every time interval and instant into the past and future, from years down to seconds. This is an infinite set of linked data. It includes government years and properly...
-
About Data exposed: a large life sciences data set Size of dump and data set: 3000M+ triples Openness Not open. Copyright page states: Copyright 2007-2012 UniProt Consortium. We...
-
The European Environment Agency (EEA) is an agency of the European Union. Our task is to provide sound, independent information on the environment. We are a major information source for...
-
The Transport dataset is based on NapTan and traffic flow data. These datasets are compiled by the Department for Transport. Details on Transport data can be found at...
-
The Geographic data is provided by Ordnance Survey, Great Britain's national mapping agency. This dataset contains the up-to-date geographic data, relied on by government, business and...
-
Using the Education Data The education dataset is based on Edubase data. This dataset is now compiled by the newly formed Education Department. Details on Edubase can be found at...
-
The data includes details of items of legislation, their versions and related documents. The legislation data is bibliographic, and the linked data representation makes use of...
-
DCLG’s data portfolio includes a whole raft of statistics on key socio-economic issues. This includes topics such as housing and planning, levels of deprivation in local areas, local...
-
COINS – the Combined On-line Information System – is used by the Treasury to collect financial data from across the public sector to support fiscal management, the production of...
-
Bund Offener Haushalt shows the complex data Munich's budget. It also gives access to the data in an open and re-usable format. part of package:offener-haushalt
-
A list of most Senior Civil Service posts in the Thurrock Thames Gateway Development Corporation including title, contact details, their line manager, and where disclosed, the name of the...
-
The Near dataset provides "near" links connecting points of interest that are geographically close to each other. The data set currently cross-links items in DBpedia, Geonames and Edubase.
-
RxNorm provides normalized names for clinical drugs and links its names to many of the drug vocabularies commonly used in pharmacy management and drug interaction software. Notes on...
-
The Santillana Guide dataset represents the content of the Santillana guide (owned by Prisa Digital) as Linked Data. The guide contains information about more than 1500 Spanish...
-
Presents a standard conversion of Princeton WordNet to RDF/OWL. It describes how it was converted and gives examples of how it may be queried for use in Semantic Web applications....
-
Maritime piracy event descriptions from the International Chamber of Commerce International Maritime Bureau Accessing the dataset SPARQL query form No dereferenceable...
-
The complete (99% of all points of interest), quality-controlled (60% updated within the last 4 weeks) GoodRelations-based description of shopping and trade in a major German city, with a...
-
This dataset contains the scores for the Living Environment Deprivation Domain of the Index of Multiple Deprivation, 2010. This indicator measures the quality of individuals’ immediate...
-
This dataset puts the 32,482 LSOAs into a rank order based on their 2007 IMD score. A rank of 1 is the most deprived.
-
About Linked ISO 3166-2 Data. ISO-3166-2 gives codes for countries and their principal subdivisions. Openness Published under CC0. (Where is this specified?)
-
This dataset describes the 'Lower layer Super Output Areas' used by the Office for National Statistics for many of its statistical outputs. Example resource:...
-
The estimated total population (male and female, all ages) for each Lower layer Super Output Area for mid-year 2005.
-
Contains addresses, type, contacts and other info about more 50.000 public schools in italy. The dataset is currently in alpha stage: its quality needs to be improved and schools are not...
-
Contractors and suppliers of the chamber of deputies in italy in 2010. More details are available at http://www.linkedopendata.it/datasets/loc The ontology is documented at...
-
Open Data Communities offers Linked Data access to the Index of Multiple Deprivation datasets from the Department of Communities and Local Government. This provides a range of useful...