Repository: EpistasisLab/ClinicalDataSources
Branch: master
Commit: 72fe5e06128e
Files: 3
Total size: 13.8 KB
Directory structure:
gitextract_0oow092u/
├── CONTRIBUTING.md
├── LICENSE
└── README.md
================================================
FILE CONTENTS
================================================
================================================
FILE: CONTRIBUTING.md
================================================
# Contribution guidelines for the Clinical Data Sources repository
## Project Philosophy
this repository of open and easy to access clinical data sources is an open source community-based project. We want to make it easy for interested researchers to access clinical datasets as easily as possible.
## Issues
We use GitHub Issues for a wide range of tasks including bug reports, feature requests, planning, and general forum. If you have any comments or quesstions feel free to create a github issue.
## Pull Requests
We operate on a pull request model, all changes/additions should be submitted via [pull request](https://help.github.com/articles/using-pull-requests/ "GitHub · Using pull requests").
To keep the commit history clean, we may ask you to [rebase your pull request](https://github.com/edx/edx-platform/wiki/How-to-Rebase-a-Pull-Request "How to Rebase a Pull Request") or squash certain commits. If you don't know what this means, we'll help you through the process.
## Peer Review
All pull requests undergo peer review in which project participants review the changes. Reviewers may suggest modifications or even note their disapproval. Reviewers can note their approval by commenting along the lines of ":+1:", "looks good to me", or "approved". As a reviewer, it's helpful to note the type of review you performed: have you used the data, only accessed it, looked it over, or are you just supporting the concept?
Before a repository maintainer merges a pull request, there must be at least one affirmative review. If there is any unaddressed criticism or disapproval, a repository maintainer will determine how to proceed and may wait for additional feedback.
**Happy contributing!**
================================================
FILE: LICENSE
================================================
CC0 1.0 Universal
Statement of Purpose
The laws of most jurisdictions throughout the world automatically confer
exclusive Copyright and Related Rights (defined below) upon the creator and
subsequent owner(s) (each and all, an "owner") of an original work of
authorship and/or a database (each, a "Work").
Certain owners wish to permanently relinquish those rights to a Work for the
purpose of contributing to a commons of creative, cultural and scientific
works ("Commons") that the public can reliably and without fear of later
claims of infringement build upon, modify, incorporate in other works, reuse
and redistribute as freely as possible in any form whatsoever and for any
purposes, including without limitation commercial purposes. These owners may
contribute to the Commons to promote the ideal of a free culture and the
further production of creative, cultural and scientific works, or to gain
reputation or greater distribution for their Work in part through the use and
efforts of others.
For these and/or other purposes and motivations, and without any expectation
of additional consideration or compensation, the person associating CC0 with a
Work (the "Affirmer"), to the extent that he or she is an owner of Copyright
and Related Rights in the Work, voluntarily elects to apply CC0 to the Work
and publicly distribute the Work under its terms, with knowledge of his or her
Copyright and Related Rights in the Work and the meaning and intended legal
effect of CC0 on those rights.
1. Copyright and Related Rights. A Work made available under CC0 may be
protected by copyright and related or neighboring rights ("Copyright and
Related Rights"). Copyright and Related Rights include, but are not limited
to, the following:
i. the right to reproduce, adapt, distribute, perform, display, communicate,
and translate a Work;
ii. moral rights retained by the original author(s) and/or performer(s);
iii. publicity and privacy rights pertaining to a person's image or likeness
depicted in a Work;
iv. rights protecting against unfair competition in regards to a Work,
subject to the limitations in paragraph 4(a), below;
v. rights protecting the extraction, dissemination, use and reuse of data in
a Work;
vi. database rights (such as those arising under Directive 96/9/EC of the
European Parliament and of the Council of 11 March 1996 on the legal
protection of databases, and under any national implementation thereof,
including any amended or successor version of such directive); and
vii. other similar, equivalent or corresponding rights throughout the world
based on applicable law or treaty, and any national implementations thereof.
2. Waiver. To the greatest extent permitted by, but not in contravention of,
applicable law, Affirmer hereby overtly, fully, permanently, irrevocably and
unconditionally waives, abandons, and surrenders all of Affirmer's Copyright
and Related Rights and associated claims and causes of action, whether now
known or unknown (including existing as well as future claims and causes of
action), in the Work (i) in all territories worldwide, (ii) for the maximum
duration provided by applicable law or treaty (including future time
extensions), (iii) in any current or future medium and for any number of
copies, and (iv) for any purpose whatsoever, including without limitation
commercial, advertising or promotional purposes (the "Waiver"). Affirmer makes
the Waiver for the benefit of each member of the public at large and to the
detriment of Affirmer's heirs and successors, fully intending that such Waiver
shall not be subject to revocation, rescission, cancellation, termination, or
any other legal or equitable action to disrupt the quiet enjoyment of the Work
by the public as contemplated by Affirmer's express Statement of Purpose.
3. Public License Fallback. Should any part of the Waiver for any reason be
judged legally invalid or ineffective under applicable law, then the Waiver
shall be preserved to the maximum extent permitted taking into account
Affirmer's express Statement of Purpose. In addition, to the extent the Waiver
is so judged Affirmer hereby grants to each affected person a royalty-free,
non transferable, non sublicensable, non exclusive, irrevocable and
unconditional license to exercise Affirmer's Copyright and Related Rights in
the Work (i) in all territories worldwide, (ii) for the maximum duration
provided by applicable law or treaty (including future time extensions), (iii)
in any current or future medium and for any number of copies, and (iv) for any
purpose whatsoever, including without limitation commercial, advertising or
promotional purposes (the "License"). The License shall be deemed effective as
of the date CC0 was applied by Affirmer to the Work. Should any part of the
License for any reason be judged legally invalid or ineffective under
applicable law, such partial invalidity or ineffectiveness shall not
invalidate the remainder of the License, and in such case Affirmer hereby
affirms that he or she will not (i) exercise any of his or her remaining
Copyright and Related Rights in the Work or (ii) assert any associated claims
and causes of action with respect to the Work, in either case contrary to
Affirmer's express Statement of Purpose.
4. Limitations and Disclaimers.
a. No trademark or patent rights held by Affirmer are waived, abandoned,
surrendered, licensed or otherwise affected by this document.
b. Affirmer offers the Work as-is and makes no representations or warranties
of any kind concerning the Work, express, implied, statutory or otherwise,
including without limitation warranties of title, merchantability, fitness
for a particular purpose, non infringement, or the absence of latent or
other defects, accuracy, or the present or absence of errors, whether or not
discoverable, all to the greatest extent permissible under applicable law.
c. Affirmer disclaims responsibility for clearing rights of other persons
that may apply to the Work or any use thereof, including without limitation
any person's Copyright and Related Rights in the Work. Further, Affirmer
disclaims responsibility for obtaining any necessary consents, permissions
or other rights required for any use of the Work.
d. Affirmer understands and acknowledges that Creative Commons is not a
party to this document and has no duty or obligation with respect to this
CC0 or use of the Work.
For more information, please see
<http://creativecommons.org/publicdomain/zero/1.0/>
================================================
FILE: README.md
================================================
# Clinical Data Sources
We are assembling a repository of clinical data sources (Electronic Health Record, Clinical trials, Imaging etc.) that are either public or have low friction application processes. If you have any comments, corrections, or know of any additional sources, please add it as a pull request.
Types of data:
* Patient Demographics: P
* Lab tests: L
* Textual Patient Notes: T
* Imaging: I
* Other: O
| Name | Description | Data Types | Patient Count | Ease of Access | Publication/Citation Guidelines |
| ----------- |:-------------------:| ------------ | ------------- | --------------- | ----------- |
| [MIMIC](https://mimic.physionet.org/) | MIMIC is an openly available dataset developed by the MIT Lab for Computational Physiology, comprising deidentified health data associated with ~40,000 critical care patients. It includes demographics, vital signs, laboratory tests, medications, and more.| P, L, T | ~45,000 | Simple Application | [Link](http://dx.doi.org/10.1038/sdata.2016.35)|
| [Physionet 2012](https://physionet.org/challenge/2012/) | Tthe focus of the PhysioNet/CinC Challenge 2012 is to develop methods for patient-specific prediction of in-hospital mortality. Participants will use information collected during the first two days of an ICU stay to predict which patients survive their hospitalizations, and which patients do not. | P, L | 12,000 ICU stays | Public | [Link](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3965265/)|
| [i2b2](https://www.i2b2.org/NLP/DataSets/Main.php) | In order to enhance the ability of natural language processing (NLP) tools to prise increasingly fine grained information from clinical records, i2b2 has previously provided sets of fully deidentified notes from the Research Patient Data Repository at Partners HealthCare for a series of NLP Challenges organized by Dr. Ozlem Uzuner. We are pleased to now make those notes available to the community for general research purposes. At this time we are releasing the notes (~1,500) from the first four i2b2 Challenges as i2b2 NLP Research Data Sets. A similar set of notes from the most recent i2b2 Challenge will be released on the one year anniversary of that Challenge.|T | ~1,500|Simple Application (DUA) | [Link (depends on data usage)](https://www.i2b2.org/NLP/DataSets/Main.php)
| [PRO-ACT ALS](https://nctu.partners.org/ProACT/)| PRO-ACT provides users with easy access to: 1) Over 10,700 fully de-identified clinical patient records, 2) Placebo and treatment-arm data from 23 Phase II/III clinical trials, 3) Demographic, lab, medical and family history, and other data elements, 4) More than 10 million longitudinally collected data points | P, L | 10,700 | Simple Application | [Link](https://nctu.partners.org/ProACT/Document/DisplayLatest/6)
| [Cancer Imaging Archive](http://www.cancerimagingarchive.net/) | TCIA is a service which de-identifies and hosts a large archive of medical images of cancer accessible for public download. The data are organized as “Collections”, typically patients related by a common disease (e.g. lung cancer), image modality (MRI, CT, etc) or research focus. DICOM is the primary file format used by TCIA for image storage. Supporting data related to the images such as patient outcomes, treatment details, genomics, pathology, and expert analyses are also provided when available. | I, some studies have others | Most studies < 100 | Many Public | [Usage](https://wiki.cancerimagingarchive.net/display/Public/Data+Usage+Policies+and+Restrictions) [Publication](https://www.ncbi.nlm.nih.gov/pubmed/23884657)
| [National Biomedical Imaging Archive](https://imaging.nci.nih.gov/ncia/login.jsf) | Welcome to the National Biomedical Imaging Archive (NBIA). NBIA is a searchable repository of in vivo images that provides the biomedical research community, industry, and academia with access to image archives to be used in the development and validation of analytical software tools that support: Lesion detection and classification, Accelerated diagnostic imaging decision, Quantitative imaging assessment of drug response | I | Most studies < 100 | Many Public | NA
| [Open Access Series of Imaging Studies (OASIS)](http://www.oasis-brains.org/) | The Open Access Series of Imaging Studies (OASIS) is a project aimed at making MRI data sets of the brain freely available to the scientific community. By compiling and freely distributing MRI data sets, we hope to facilitate future discoveries in basic and clinical neuroscience. OASIS is made available by the Washington University Alzheimer’s Disease Research Center, Dr. Randy Buckner at the Howard Hughes Medical Institute (HHMI) at Harvard University, the Neuroinformatics Research Group (NRG) at Washington University School of Medicine, and the Biomedical Informatics Research Network (BIRN). | I | 416 (study 1), 150 (study 2) | Public | [Study 1](https://www.ncbi.nlm.nih.gov/pubmed/17714011) [Study 2](https://www.ncbi.nlm.nih.gov/pubmed/19929323)
<p xmlns:dct="http://purl.org/dc/terms/" xmlns:vcard="http://www.w3.org/2001/vcard-rdf/3.0#">
<a rel="license"
href="http://creativecommons.org/publicdomain/zero/1.0/">
<img src="http://i.creativecommons.org/p/zero/1.0/88x31.png" style="border-style: none;" alt="CC0" />
</a>
<br />
To the extent possible under law,
<a rel="dct:publisher"
href="https://github.com/EpistasisLab/ClinicalDataSources">
<span property="dct:title">Brett Beaulieu-Jones</span></a>
has waived all copyright and related or neighboring rights to
<span property="dct:title">Open or Easy Access Clinical Data Sources for Biomedical Research</span>.
This work is published from:
<span property="vcard:Country" datatype="dct:ISO3166"
content="US" about="https://github.com/EpistasisLab/ClinicalDataSources">
United States</span>.
</p>
gitextract_0oow092u/ ├── CONTRIBUTING.md ├── LICENSE └── README.md
Condensed preview — 3 files, each showing path, character count, and a content snippet. Download the .json file or copy for the full structured content (14K chars).
[
{
"path": "CONTRIBUTING.md",
"chars": 1713,
"preview": "# Contribution guidelines for the Clinical Data Sources repository\n\n## Project Philosophy\n\nthis repository of open and e"
},
{
"path": "LICENSE",
"chars": 6555,
"preview": "CC0 1.0 Universal\n\nStatement of Purpose\n\nThe laws of most jurisdictions throughout the world automatically confer\nexclus"
},
{
"path": "README.md",
"chars": 5851,
"preview": "# Clinical Data Sources\nWe are assembling a repository of clinical data sources (Electronic Health Record, Clinical tria"
}
]
About this extraction
This page contains the full source code of the EpistasisLab/ClinicalDataSources GitHub repository, extracted and formatted as plain text for AI agents and large language models (LLMs). The extraction includes 3 files (13.8 KB), approximately 3.3k tokens. Use this with OpenClaw, Claude, ChatGPT, Cursor, Windsurf, or any other AI tool that accepts text input. You can copy the full output to your clipboard or download it as a .txt file.
Extracted by GitExtract — free GitHub repo to text converter for AI. Built by Nikandr Surkov.