Datasets Included in Open PHACTS Version 2.0

Dataset

Downloaded

Version

Licence

Triples

Bio Assay Ontology



CC-By

10,360

CALOHA

8 Apr 2015

2014-01-22

CC-By-ND

14,552

ChEBI

4 Mar 2015

125

CC-By-SA

1,012,056

ChEMBL

18 Feb 2015

20.0

CC-By-SA

445,732,880

ConceptWiki

12 Dec 2013


CC-By-SA

4,331,760

DisGeNET

31 Mar 2015

2.1.0

ODbL

15,011,136

Disease Ontology


2015-05-21

CC-By

188,062

DrugBank

19 Feb 2015

4.1

Non-commercial

4,028,767

ENZYME


2015_11

CC-By-ND

61,467

FDA Adverse Events

9 Jul 2012


CC0

13,557,070

Gene Ontology

4 Mar 2015


CC-By

1,366,494

Gene Ontology Annotations

17 Feb 2015


CC-By

879,448,347

NCATS OPDDR

Nov 2015

Oct 2015


2,643

neXTProt (NP)

1 Feb 2014

1.0

CC-By-ND

215,006,108

OPS Chemical Registry


4 Nov 2014

CC-By-SA

241,986,722

HMDB


3.6

HMDB


MeSH


2015

MeSH


PDB Ligands


2

PDB


OPS Metadata



CC-By-SA

2,053

UniProt


2015_11

CC-By-ND

1,131,186,434

WikiPathways


20151118

CC-By

11,781,627


Datasets Included in Open PHACTS Version 1.5


Version 1.5 includes data from one new data set, the FDA Adverse Events Reporting System 

(FAERS) and additional fields from DrugBank. The drug-drug interactions from DrugBank have  been added to Compound Information calls.


The data currently in the Open PHACTS system (date downloaded and version numbers):


UniProt from 28 Jan 2015, release 2015_1

ENZYME from 02 Feb 2015, release 2015_1

DrugBank from 19 Feb 2015, version 4.1

ChEMBL from 18 Feb 2015, ChEMBL 20

ChEBI 04 Mar 2015 ChEBI, Release 125

FDA Adverse Events (FAERS) data, 09 Jul 2012

Gene Ontology, 04 Mar 2015

Gene Ontology Annotations, 17 Feb 2015

WikiPathways, 20 Mar 2015, v20150312

DisGeNet, 31 Mar 2015, v2.1.0


Data sources not updated:

ConceptWiki

Open PHACTS Chemical Registration Service (OCRS)

neXtProt 


Additional Datasets Included in Open PHACTS Version 1.4


With the release of the 1.4.0 version there have been 2 new data sets added to the system


DisGeNET


neXtProt tissue expression



For version 1.3.0 there are slightly over 2.7 billion triples in the Open PHACTS data cache.


To verify the information in the table below, the most up to date information can be found using the "DataSources" API method call. Our data is described using a standard called VoID. To see the current versions of our datasets, run the "sources" method on the Open PHACTS API.


Please note we distinguish between databases we load in and identifiers we support. We support many more identifiers than just those listed below. For instance, you can query Open PHACTS with an ENSEMBL ID as we support that identifier. We have not loaded the ENSEMBL database into the platform however. For more information on mappings, see http://openphacts.cs.man.ac.uk/.


Statistics of Datasets Loaded into Open PHACTS Version 1.3

Source Version Supplier Downloaded Initial Records Triples Properties
Chembl Chembl 16 RDF EBI 25 June 2013 1,481,473 (~1,295,510 compounds, 9,844 targets, 6,243 target components, 873 protein classes)
304,360,719 77
DrugBank Aug 2008 Bio2Rdf (www4.wiwiss.fu-berlin.de) 08 Aug 2012 19,628(~14,000 drugs, 5000 targets) 517,584 74
SwissProt, UniParc, UniRef 2013_06 SIB 2013_06  564,246 405,473,138 82
ENZYME 2013_07 SIB 2013_07 6,187 73,459 2
ChEBI Release 104 EBI 19 June 2013 40,575 1,673,863 2
GeneOntology Jan 21, 2013 GO 21 Jan 2013 38,137 2,447,682 26
GOA  2013 GO 09 Sept 2013 661,232 1,765,622,393 15
WikiPathways v0.?1_20130710 Maastricht 10 July 2013 946 1,949,074 34
ChemSpider
Open PHACTS Chemistry Registry (OCRS) Nov 11, 2013 1,361,568 215,193,441 23
ConceptWiki version 1.3 NBIC 09 Sept 2013 2,828,966 4,291,131 1