Challenges in obtaining medical data

- 5 mins

Machine learning models rely on the training data provided. The algorithms learn from the examples available. It generally requires a lot of training data to develop models that are reliable and generalisable. The following are some of the challenges associated with getting workable data for medical projects :

As per the American Hospital Association survey from 2009[3] :

On the basis of responses from 63.1% of hospitals surveyed, only 1.5% of U.S. hospitals have a comprehensive electronic-records system (i.e., present in all clin- ical units), and an additional 7.6% have a basic system (i.e., present in at least one clinical unit). Computerized provider-order entry for medications has been implemented in only 17% of hospitals. Larger hospitals, those located in urban areas, and teaching hospitals were more likely to have electronic-records systems. Respondents cited capital requirements and high maintenance costs as the primary barriers to implementation…..

The above numbers have significantly improved in the US after the HITECH Act, but there are still issues as cited by Colligan et. al. [4] which leave hospitals dissatisfied with the EHR adoption and uses. It is also to be noted that while USA has been largely successful in increasing EHR usage among hospitals, so is not the case for most other countries. Many hospitals around the world still do not use EHR extensively and a lot of work in hospitals still happens on paper with little or no digitisation, with costs of setting up a digital system and amount of time and money needed to be spent in training the employees being a major barrier.

References :

[1] Deng et. al ImageNet: A Large-Scale Hierarchical Image Database. CVPR, 2009

[2] http://image-net.org/about-stats

[3] Jha et.al Use of Electronic Health Records in U.S. Hospitals, The New England Journal of medicine, 2009

[4] Colligan et. al. American Medical Association Sources of physician satisfaction and dissatisfaction and review of administrative tasks in ambulatory practice: A qualitative analysis of physician and staff interviews.

[5] Linda T. Kohn, Janet M. Corrigan, and Molla S. Donaldson To Err Is Human: Building a Safer Health System (1999).Committee on Quality of Health Care in America, Institute of Medicine, National Academy Press, Washington, D.C.

[6] HIMSS Electronic Health Record Committee EHR Definition, Attributes and Essential Requirements Version 1.0

[7] https://www.legislation.gov.au/Details/C2012A00063

[8] International Classification of Diseases,Ninth Revision (ICD-9) https://www.cdc.gov/nchs/icd/icd9.htm

[9] ICD-10 versions on the WHO website https://www.who.int/classifications/icd/icdonlineversions/en/

[10] ICD-10 on CDC website https://www.cdc.gov/nchs/icd/icd10cm.htm