Published on in Vol 8, No 6 (2021): June

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/26681, first published .
Design and Implementation of an Informatics Infrastructure for Standardized Data Acquisition, Transfer, Storage, and Export in Psychiatric Clinical Routine: Feasibility Study

Design and Implementation of an Informatics Infrastructure for Standardized Data Acquisition, Transfer, Storage, and Export in Psychiatric Clinical Routine: Feasibility Study

Design and Implementation of an Informatics Infrastructure for Standardized Data Acquisition, Transfer, Storage, and Export in Psychiatric Clinical Routine: Feasibility Study

Original Paper

1Institute of Medical Informatics, University of Münster, Münster, Germany

2Department of Psychiatry, University of Münster, Münster, Germany

3The Florey Institute of Neuroscience and Mental Health, University of Melbourne, Melbourne, Australia

4Department of Psychiatry, Melbourne Medical School, University of Melbourne, Melbourne, Australia

5Institute of Medical Informatics, Heidelberg University Hospital, Heidelberg, Germany

6Institute for Translational Psychiatry, University of Münster, Münster, Germany

7Interdisciplinary Centre for Clinical Research of the Medical Faculty, University of Münster, Münster, Germany

*these authors contributed equally

Corresponding Author:

Nils Opel, MD

Department of Psychiatry

University of Münster

Albert-Schweitzer-Str. 11

Münster, 48149

Germany

Phone: 49 2518358160

Email: n_opel01@uni-muenster.de


Background: Empirically driven personalized diagnostic applications and treatment stratification is widely perceived as a major hallmark in psychiatry. However, databased personalized decision making requires standardized data acquisition and data access, which are currently absent in psychiatric clinical routine.

Objective: Here, we describe the informatics infrastructure implemented at the psychiatric Münster University Hospital, which allows standardized acquisition, transfer, storage, and export of clinical data for future real-time predictive modelling in psychiatric routine.

Methods: We designed and implemented a technical architecture that includes an extension of the electronic health record (EHR) via scalable standardized data collection and data transfer between EHRs and research databases, thus allowing the pooling of EHRs and research data in a unified database and technical solutions for the visual presentation of collected data and analyses results in the EHR. The Single-source Metadata ARchitecture Transformation (SMA:T) was used as the software architecture. SMA:T is an extension of the EHR system and uses module-driven engineering to generate standardized applications and interfaces. The operational data model was used as the standard. Standardized data were entered on iPads via the Mobile Patient Survey (MoPat) and the web application Mopat@home, and the standardized transmission, processing, display, and export of data were realized via SMA:T.

Results: The technical feasibility of the informatics infrastructure was demonstrated in the course of this study. We created 19 standardized documentation forms with 241 items. For 317 patients, 6451 instances were automatically transferred to the EHR system without errors. Moreover, 96,323 instances were automatically transferred from the EHR system to the research database for further analyses.

Conclusions: In this study, we present the successful implementation of the informatics infrastructure enabling standardized data acquisition and data access for future real-time predictive modelling in clinical routine in psychiatry. The technical solution presented here might guide similar initiatives at other sites and thus help to pave the way toward future application of predictive models in psychiatric clinical routine.

JMIR Ment Health 2021;8(6):e26681

doi:10.2196/26681

Keywords



Scientific Background

Psychiatric disorders represent one of the leading causes of disability worldwide. In the challenge to provide advanced treatment and prevention strategies for psychiatric disorders, previous research has focused on better understanding of the neurobiological basis of affective disorders [1]. However, the translation of such findings into clinical application remains an unresolved problem up to now. For this reason, the focus of psychiatric research has shifted from sole neurobiological characterization at the group level toward the application of multivariate machine learning methods trained on multimodal data for individualized prediction of clinical outcomes [2,3]. Multivariate machine learning applications have been proven to be innovative and powerful tools in translational psychiatric research. In this regard, the successful utilization of machine learning algorithms for individualized predictions of treatment response [4-6], depression severity [7], disease risk [8], differential diagnosis [9,10], and relapse risk [11] has yielded the first promising results. However, up to now, several obstacles have prevented the successful transfer of individual predictive modeling to clinical routine application, as discussed in recent reviews [12-15]. In this regard, the gap between homogeneous well-characterized samples acquired in experimental studies [16] and heterogeneous unvalidated data from day-to-day clinical routine has proven to be a major obstacle in the translation of predictive models to clinical application. Hence, ecologically valid predictive models would require access to standardized real world data collected at the point of care [17].

Importantly, large-scale studies reporting the successful application of multivariate models trained on data from electronic health records (EHRs), including features such as diagnosis and procedures, laboratory parameters, and medications for the prediction of suicide risk or weight gain following antidepressant treatment have demonstrated the capacity and generalizability of predictive models trained on real-world data [18-20]. Further extension of EHRs via standardized collection of predictive variables such as known risk factors might further enhance the potential of this novel data entity for predictive analytics in psychiatry [21,22]. Standardized electronic collection of patient-reported outcomes that has previously been shown to improve clinical outcomes such as survival in patients with cancer represents another possibility to enrich EHR data. Similarly, combining data from EHRs with research data might provide new opportunities for the discovery and validation of psychiatric endophenotypes as demonstrated via recent validation of a polygenic risk score in a Danish population study [23]. However, future application of predictive models for personalized diagnostic and treatment requires their validation via clinical trials that, in turn, critically depend on the availability of the informatics infrastructure for the application of predictive models in routine care. The required informatics infrastructure should facilitate the acquisition of standardized real world data at the point of care, potential enrichment with patient-reported outcomes or research data, and subsequent access to data for clinicians and researchers. However, while these technical requirements are already available in selected clinical settings, for example, in the United States [24], they are up to now absent in the clinical working environment of psychiatry hospitals in many European countries. More concretely, ORBIS, the EHR system that is the market leader in Germany, Austria, and Switzerland, does not currently support standardized form metadata, clinical data, or annotated data sets. Our approach thus addresses the currently unmet need to (1) implement the technical requirements for standardized data acquisition and analysis in one of the most widely used EHR systems in Europe and (2) to specifically design a technical solution, including appropriate data collection routines, for the domain of clinical psychiatry.

This study aims to present the design and implementation of the technical requirements to address the aforementioned challenges with the ultimate goal of providing the basis for a successful future translation of predictive models to clinical application in psychiatric disorders. The implementation of the outlined technical solution will ultimately allow the evaluation of the potential of predictive models for the clinical management of psychiatric disorders under real-world conditions. In detail, we present the design and implementation of the informatics infrastructure, including technical solutions for (1) extension of the EHR via standardized electronic collection of patient-reported outcomes, (2) data transfer between EHRs and research databases, (3) pooling of EHRs and research data in a unified database, and (4) visual presentation of the analyses results in the EHRs.

Objective of This Study

The main objective of this study was the design and successful implementation of the informatics infrastructure required to train and validate predictive models in day-to-day clinical application in psychiatry as part of the SEED 11/19 study [25]. Our study consisted of the following steps in detail:

  1. Implementation of standardized documentation forms in EHRs.
  2. The set-up of an interface for direct data transfer between clinical documentation systems and a database for predictive analysis.
  3. The set-up of a unified database that allows pooling of clinical data with further research data for predictive analysis.
  4. Visual presentation of relevant data entities and results of predictive analysis in EHRs at the point of care.

Setting

The Münster University Hospital in Germany is a tertiary care hospital with 1457 beds and 11,197 staff who treated 607,414 patients (inbound and outbound) in 2019 [26]. The department for psychiatry and psychotherapy at the University Hospital treated 1341 cases in the study period from February 25, 2019 to July 31, 2020 (1042 cases in 2018 [27,28]). Validation was carried out by 25 doctors and 61 specialists from the health care sector.

System Details

The EHR system ORBIS by Dedalus Healthcare is used at Münster University Hospital in more than 40 clinics and is the market leader in Germany, Austria, and Switzerland with over 1300 installations [29]. The EHR system has an 8700 GB Oracle database, 7938 users, and 1927 user sessions per day (status at July 2020) at Münster University Hospital. No standardized metadata form, clinical data, and annotated data sets are supported.

Requirement Engineering

To address the study aims, the following requirements were identified through focus groups including physicians and researchers at Münster University Hospital in Germany.

  1. Extension of the EHR via standardized data collection: At first sight, the widely established usage of electronic documentation systems in clinical routine might supplement the notion of a fast translation of predictive models. However, until now, the majority of clinical data is still acquired and stored in an unstructured way that cannot be directly used for predictive modeling. Extension of EHR data via standardized forms of data collection in routine care is therefore required to provide a sufficient database for the development of predictive models. Importantly, the technical solution should be flexible and allow to update the content of the collected EHR data. Content-wise, in an initial step, standardized extension of EHR data should include assessment of symptomatology in order to allow both patient stratification at baseline as well as outcome measurement following intervention. Furthermore, standardized assessment of known risk factors, including life events and sociodemographic data, appears meaningful.
  2. Data transfer: Routine EHR data storage systems are usually strictly separated from research databases for safety reasons and hence are not directly accessible for predictive analyses. Training and validation of predictive models based on EHR data requires the set-up of interfaces and a database in which EHR data can be transferred and subsequently stored in a standardized way. In line with our study aim, the technical solutions should be scalable and allow data transfer in real time. EHR data transferred and stored in the database must be accessible for researchers in order to allow the development of predictive models.
  3. Combination of EHRs and research data: Again, since routine EHR data storage systems are strictly separated from research databases, pooling of EHR and research data is not possible within state-of-the art EHR databases. Pooling EHR with research data in a unified database would allow the enrichment of predictive models trained on EHR data by adding already existing research data and furthermore to validate EHR data based on research data. To this end, in order to combine each patient’s EHR and research data, a unified scalable research database is needed that allows the integration of EHRs and research data acquired via experimental studies.
  4. Presentation of standardized data within the EHR: Once collected, clinically useful standardized data as well as results of any analysis must be transferred back to the main EHR system in real time and presented to the clinician at the point of care.

Solution Requirements

An informatics infrastructure enabling real-time clinical predictive modeling based on the single-source architecture was derived from the named requirements. Custom metadata must be supported. The Clinical Data Interchange Standards Consortium (CDISC) Operational Data Model (ODM) (version 1.3.2) was used as a flexible standard for exchange and archiving of metadata within the framework of clinical studies [30,31]. Mobile apps must be able to communicate with the architecture. Automatic data transfer into the database of the EHR system and from the EHR system to a research database was carried out via a communication server. ODM files were transported automatically to the database of the EHR system with Health Level 7 (HL7) messages [32]. NextGen Connect [33] was used as a communication server. HL7 version 2.5 and message type ORU^R01 were used. The plausibility and completeness of form data were validated by the clinical users.

Analysis of Technical and Clinical Feasibility

The technical feasibility was demonstrated by the implementation of an infrastructure that enables clinical predictive modeling in real time. Java version 1.8.0_181 [34], JavaScript ECMAScript 6 [35], TypeScript version 3.7.2 [36], and the proprietary language of the EHR system were used as programming languages. MongoDB Java Drivers version 3.9.1 [37] and Json-lib version 2.4 [38] were used as third-party libraries. MongoDB version 4.2.3 [39] was used as a research database, Docker version 19.03.13 [40] for operating system–level virtualization, and Red Hat Enterprise version 7.8 [41] as research server. The clinical feasibility was determined by piloting the architecture in the clinic for psychiatry and psychotherapy and for a prospective analysis of the clinical documentation forms used. The clinical users of the system were 25 doctors and 61 health care sector specialists. The stakeholder of the study at Münster University Hospital is the Institute for Translational Psychiatry, Department of Psychiatry. Evaluation began on February 25, 2019 and ended on July 31, 2020. EHR data from daily clinical routine (eg, laboratory data, diagnostic codes) and self-reports/patient-reported outcomes that were experimentally collected as an extension of the clinical routine documentation as part of the SEED research project were examined. The following evaluation criteria were analyzed: (1) measurement of data completeness in the created documentation forms, (2) measurement of data completeness in the research database, (3) monitoring of system stability, and (4) monitoring of data transfer. SPSS Statistics version 25 (IBM Corp) [42] was used for descriptive data analysis. Adobe Photoshop version 11.0 [43] and Microsoft Visio version 16.0.4849.1000 [44] were used to depict the workflow.


System Architecture

The Single-source Metadata ARchitecture Transformation (SMA:T) was used as the software architecture [45]. SMA:T is an extension of the EHR system of the Münster University Hospital and uses module-driven software development [46] to generate standardized applications and interfaces. Every SMA:T form has a generic built-in interface for exchanging standardized data. Embedded applications [45] were used as the application type. These are linked to an ODM file in the EHR database, from which a documentation form is generated. All metadata and clinical data are available in the ODM developed by CDISC version 1.3.2. Patient-reported outcomes are recorded via Mobile Patient Survey (MoPat) [47,48] on mobile devices (generation 6 iPads) and via the web application Mopat@home (a modified version of the tablet-based web app MoPat) [49] for follow-up assessments following discharge from inpatient treatment. Collected data are transferred to the communication server via an HL7 message and from there to the database of the EHR system. Data are sent in the OBX-5 segment of the HL7 message. SMA:T provides database storage. A reference to the imported clinical data is saved. Clinical data are automatically inserted by SMA:T when the documentation form is opened for the first time. A unique ID from the HL7 header is used for this purpose. Each ID is linked to an imported clinical record. The structure of the architecture is shown in Figure 1. Data transfer to the research database takes place via the researcher module from SMA:T. This provides a front end to the EHR system and an extension of the communication server for data transfers. Both prospective and retrospective standardized data exports of EHR data points are supported, specifically, vital signs, laboratory data, medication data, and administrative data. Each data export can be customized by individual parameters. The following parameters are supported: name of data export, export interval, database query, destination parameters for electronic data capture systems, or research databases. MongoDB and RedCap [50] are currently provided as destination templates in the EHR system. The destination portfolio can easily be expanded with interface functions of SMA:T. The research database is embedded in a Docker container of a virtualized Red Hat Enterprise Linux server. The data flow from EHR to electronic data capture is shown in Figure 2. The software architecture is shown in Figure 3.

Figure 1. Unified Modeling Language sequence diagram of the data collection workflow. In process steps 1-3, the patient completes the forms and sends data to the communication server. In process steps 4-8, the communication server sends data to the electronic health record system and creates a blank documentation form. This form is populated with imported data. In process steps 9-13, SMA:T creates the documentation form with metadata and imported data. EHR: electronic health record; HL7: Health Level 7; MoPat: Mobile Patient Survey; ODM: operational data model; SMA:T: Single-source Metadata ARchitecture Transformation.
View this figure
Figure 2. Unified Modeling Language sequence diagram of the data extraction workflow. In process steps 1-8, a study query is created with SMA:T and a generic operational data model file is saved in the database of the electronic health record system. In process steps 9-18, a generic Mirth Channel is created based on the study query. In process steps 19-20, data points are automatically extracted from the electronic health record system and transferred to the study database using operational data model standard format. EDC: electronic data capture; EHR: electronic health record; HDD: Hard Disc Drive; HL7: Health Level 7; LOC: Lines Of Code; MoPat: Mobile Patient Survey; ODM: operational data model; SMA:T: Single-source Metadata ARchitecture Transformation.
View this figure
Figure 3. SEED software architecture of the Münster University Hospital. EHR: electronic health record; MoPat: Mobile Patient Survey; SMA:T: Single-source Metadata ARchitecture Transformation; *supports custom applications.
View this figure

System Implementation

The implementation of the architecture is divided into 4 areas: data collection, data transfer, data storage, and data visualization. Agile methods were used for Project Life Cycle and Development Cycle [51].

Data Collection

SMA:T provides 2 options for data collection, namely, the EHR system in clinical routine and dedicated web applications. Data input via web applications can be designed freely. In this study, EHR data generated as part of clinical routine documentation comprised, among others, laboratory data, medication, information on diagnosis, time of admission, and length of stay and are presented in detail in Table 1. MoPat [47,48] was selected for the collection of patient-reported outcomes. After input of the patient case ID, staff handed the patient an iPad with the MoPat app. Patients were then guided through a series of documentation forms comprising different questionnaires and they entered data on the mobile device (Figure 4). The iPad was then returned to the medical staff. Further details regarding the collection of patient-reported outcomes during inpatient treatment have previously been described [25]. In brief, the self-reports applied in this study are based on well-established questionnaires and scales in the domain of psychiatry and clinical psychology. In addition, to the retention of single item information, sum scales were calculated based on the recommendations provided in the original manuals and references [52-58]. In addition, Mopat@home was used for the collection of patient-reported outcomes following discharge. To this end, patients were sent an email, which provided a link to a website in which the above referenced questionnaires were presented and could be filled out [49].

Table 1. Research documentation used in the Department of Psychiatry.
Name of the documentation formItems
SEED ClinicalData Admission Date & Time4
SEED ClinicalData Classification2
SEED ClinicalData Diagnosis-Related Groups/Diagnosis3
SEED ClinicalData Electroconvulsive Therapy11
SEED ClinicalData Laboratory Assessments7
SEED ClinicalData Medication5
SEED ClinicalData Patient4
SEED ClinicalData Vital Signs3
Figure 4. One item of Beck Depression Inventory presented in the MoPat app (clinic for psychiatry and psychotherapy at Münster University Hospital).
View this figure
Data Transfer

SMA:T provides 2 types of data transfer in the present scenario, that is, data transfer into the EHR system and transfer into the electronic data capture system. MoPat sends data to the EHR system via the communication server of the University Hospital. Data are saved in the ClinicalData structure of the ODM format. The ODM document is embedded in an HL7 message. Each HL7 message creates a form in the EHR system. The header of the HL7 message determines which form is automatically created. Data transfer to the electronic data capture takes place via SMA:T interfaces. Both retrospective and prospective data exports in real time are supported. When a study query was activated via the EHR frontend, metadata and corresponding structured query language statements were read by the SMA:T extension of the communication server. SMA:T uses its code library and channel framework to generate unique Mirth channels. These send a database query to the EHR system and transfers the output directly to the electronic data capture system. Both metadata (clinical documentation form) and clinical patient data are provided by SMA:T in the ODM format. Data records are combined into an ODM document. In this study, SMA:T converts the resulting XML-based ODM document into JavaScript Object Notation format [59] (JODM format [60]). The JavaScript Object Notation schema [61] of JODM [60] is open source and currently limited to Study and ClinicalData nodes, including all subnodes of the ODM in version 1.3.2.

Data Storage

Data storage addresses metadata and clinical data. Metadata of clinical documentation forms are stored centrally in the SMA:T database. The SMA:T database model is part of the EHR database model. Metadata and clinical data are available in the ODM format. MoPat also supports ODM format; therefore, the same data model can be used for both systems. Clinical data are clearly identified by unique object identifiers and the associated object identifier on the documentation form.

Data Presentation

Usability principles were applied to visualize data [62-64]. A one-column layout was implemented according to the requirements of the 10 web form design guidelines [65]. Those forms are displayed via SMA:T within the EHR system (see Multimedia Appendix 1 and Multimedia Appendix 2). SMA:T supports item-based real-time notifications as well as centralized notification services to display analysis results in real time.

Technical and Clinical Feasibility

As part of the study, 11 standardized documentation forms with 202 items were created for the clinic for psychiatry and psychotherapy (Table 2): Beck Depression Inventory [66], Big Five Inventory (BFI-2-S) [67], Big Five Inventory (BFI-2-XS) [68], Childhood Trauma Questionnaire [69], Family Mental History [70], Hamilton Depression Scale [71], Narcissistic Admiration and Rivalry Questionnaire [72], Symptom Checklist-90 Somatization Scale [73], sociodemographic questionnaire [74], questions on individual disease course [75], and questions on somatic comorbidities [76]. Data models without license restrictions are available in the portal of medical data models. A documentation form is a document from the EHR system (see Multimedia Appendix 1) and consists of several items. An item consists of an input field and the associated label. For example, 1 item from Multimedia Appendix 1 is the drop-down box labeled A; 5866 instances were created by the patients and automatically transferred to the EHR system of the Münster University Hospital without errors. An instance is a form created by a user; 412 cases from 317 patients were processed by 86 users (Table 3). A case is defined as an inpatient stay or an outpatient visit to a hospital or clinic. Of the 123 of the medical staff of the clinic for psychiatry and psychotherapy, 86 (69.9%) worked with those documentation forms. The data quality could be improved by the ODM. Metadata was a critical step in building a generic and automated workflow. All items are now provided with a unique object identifier, have a typing of the data types, and a code list for converting text into numeric values. Automatic generation of documentation forms was accepted in routine clinical use. Standardized data transfer from the communication server into the EHR system was completed without error. It was possible to display all items (n=202) from ODM structures in full by using the generic workflow. Clinical data from 317 patients was stored in the EHR database; 96.7% (4360/4509) of the scores could be calculated and transferred into the EHR system (Beck’s Depression Inventory [52,53], Big Five Inventory [54], Childhood Trauma Questionnaire [55], Hamilton Depression Scale [56], Narcissistic Admiration and Rivalry Questionnaire [57], and Symptom Checklist-90 Somatization Scale [58]) (Table 4), and 111,842 items were completed by patients on mobile devices (Table 5). Approximately 99.3% (91,329/91,974) of forms with scores (91,974 items/645 uncompleted items) were completed and 88.1% (12,617/14,318) of forms without scores (14,318 items/1701 uncompleted items) were completed. The validity of the acquired data on depressive symptomatology was already analyzed in a feasibility study [25]. Eight standardized documentation forms with 39 items were created for the retrospective data export (Table 1): SEED ClinicalData Admission Date & Time [77], SEED ClinicalData Classification [78], SEED ClinicalData Diagnosis-Related group/Diagnosis [79], SEED ClinicalData electroconvulsive therapy [80], SEED ClinicalData Laboratory Assessments [81], SEED ClinicalData Medication [82], SEED ClinicalData Patient [83], and SEED ClinicalData Vital Signs [84]. A total of 96,323 instances of vital signs, laboratory data, medication data, and administrative data could be automatically transferred from the EHR system to the research database (Table 6). Retrospective ODM-based data export worked correctly without technical errors, and 585 instances were created by the patients with Mopat@home and transferred to the research database via SMA:T (Table 7).

Table 2. Routine documentation used in the Department of Psychiatry.
Name of the documentation form (n=11)Items (n=202)
Beck Depression Inventory23
Big Five Inventory (BFI-2-S)35
Big Five Inventory (BFI-2-XS)20
Childhood Trauma Questionnaire34
Family Mental History14
Hamilton Depression Scale25
Narcissistic Admiration and Rivalry Questionnaire9
Symptom Checklist-90 Somatization Scale14
Sociodemographic questionnaire5
Questions on individual disease course18
Questions on somatic comorbidities5
Table 3. Number of instances created for each documentation form: the counts of patients, patient cases, and users are shown.
Name of the documentation formCasesPatientsInstancesUsers
Beck Depression Inventory380307126650
Big Five Inventory (BFI-2-S)35830355925
Big Five Inventory (BFI-2-XS)25821769219
Childhood Trauma Questionnaire31330335433
Family Mental History31530534331
Hamilton Depression Scale35029651642
Narcissistic Admiration and Rivalry Questionnaire35730255820
Symptom Checklist-90 Somatization Scale36030356418
Sociodemographic questionnaire31530534426
Questions on individual disease course31530534226
Questions on somatic comorbidities31330332810
Table 4. Data quality of patient-based documentation regarding score calculation.
Name of the documentation formInstancesScoresMissing dataa
Beck Depression Inventory1266123828
Big Five Inventory (BFI-2-S)55954019
Big Five Inventory (BFI-2-XS)69265636
Childhood Trauma Questionnaire35432034
Hamilton Depression Scale51650214
Narcissistic Admiration and Rivalry Questionnaire5585508
Symptom Checklist-90 Somatization Scale56455410

aMissing data frequency is determined by missing data entries.

Table 5. Data on the completeness of the documentation forms.a
Name of the documentation formItemsCompleted itemsUncompleted items
Beck Depression Inventory29,11829,015103
Big Five Inventory (BFI-2-S)19,56519,51946
Big Five Inventory (BFI-2-XS)13,84013,739101
Childhood Trauma Questionnaire12,03611,98551
Family Mental History48024354448
Hamilton Depression Scale12,38412,076308
Narcissistic Admiration and Rivalry Questionnaire5031501219
Symptom Checklist-90 Somatization Scale7896787917
Sociodemographic questionnaire172017155
Questions on individual disease course61565453703
Questions on somatic comorbidities16401095545

aIn this context, completeness means that the documentation form contains values in all data points.

Table 6. Number of retrospectively transferred research documentation forms (electronic health record to electronic data capture).a
Name of the documentation formInstances in electronic health records
SEED ClinicalData Admission Date & Time245
SEED ClinicalData Classification8260
SEED ClinicalData Diagnosis-Related Groups/Diagnosis1163
SEED ClinicalData Electroconvulsive therapy452
SEED ClinicalData Laboratory Assessments22,886
SEED ClinicalData Medication14,244
SEED ClinicalData Patient245
SEED ClinicalData Vital Signs48,828

aElectronic health record data were extracted with generic study queries in the Single-source Metadata ARchitecture Transformation system.

Table 7. Number of instances created with Mopat@home for each documentation form.
Name of the documentation formInstances
Beck Depression Inventory65
Big Five Inventory (BFI-2-S)64
Childhood Trauma Questionnaire65
Family Mental History65
Narcissistic Admiration and Rivalry Questionnaire64
Symptom Checklist-90 Somatization Scale66
Sociodemographic questionnaire66
Questions on individual disease course65
Questions on somatic comorbidities65

Answers to the Study Questions

The aim of this study was the design and implementation of an informatics infrastructure enabling standardized data acquisition at the point of care and subsequent accessibility of clinical data for analytic purposes, which is required for future application of predictive models in day-to-day clinical routine in psychiatry. In this study, we have shown the overall technical feasibility of the implemented solution. Standardized documentation forms were implemented to extend EHR data domains and to improve data quality in the EHR system. An automated transfer of data into the EHR system and the research database was implemented, thus enabling the pooling of EHR data with already existing research data from ongoing cohort studies. This system was accepted by clinical staff from the Department of Psychiatry of Münster University Hospital in Germany. Widespread use of documentation forms could be demonstrated. Standardized electronic data collection in the EHR at the point of care was successfully implemented. The latter solution can similarly be applied for the presentation of results from predictive models.

Strengths and Weaknesses of This Study

The major strengths of this study are standardized acquisition, transfer, storage, and export of data in real time with a generic informatics infrastructure. This system fulfills the prerequisites for future predictive modelling in clinical routine in psychiatry [85-87]. Standardized data transfer in ODM format provides scalability in the context of complex medical data structures. The Define-XML standard, an extension of the ODM standard, is mandated by regulatory authorities such as Food and Drug Administration for metadata [88]. Compliance with regulatory standards is the major advantage of our infrastructure regarding future clinical studies. The data format had to be converted due to the research database, which is a limitation. MongoDB was chosen for rapid analysis of large amounts of data in previous work [89]. Standardized automatic data transfer into research databases was possible for both retrospective and prospective research questions. The data of the EHR system was responsible for the number of documentation forms for the retrospective export. Data export can be configured centrally from the EHR system in compliance with local data protection regulations. Our approach is scalable because ORBIS EHR systems are used in more than 1300 hospitals in Germany, Austria, and Switzerland. The evaluation concentrated on technical and clinical feasibility. Limitations include the lack of elaborated standardized evaluations of the user experience of the system by clinical staff. Moreover, further evaluation is necessary in order to assess the sustainable benefit in everyday clinical practice. Although the feasibility and acceptability of the implemented data input interface has been demonstrated in a recent publication [24] and the wide-spread use of the implemented data presentation format in the EHR indicates acceptability, it appears important to note that no further feedback from clinicians (ie, in the form of structured interviews or questionnaires) has been acquired, which limits the informative value regarding user satisfaction. This important issue should therefore be addressed by future works based on elaborated user feedback. Of note, the projected acquisition of data from several hundred cases per year based on our set-up results in a database of modest scale was comparable to that by successfully established deep learning models in other fields of medicine [90]. Yet, it appears important to take into account that that the current state-of-the-art machine learning approaches in psychiatric research are based on cohorts with smaller sample sizes that were acquired over a period of multiple years [91-93]. The present initiative that aims to train predictive models on data from clinical routine documentation thus offers a perspective to significantly increase sample sizes in machine learning research in psychiatry. The training of predictive models as well as their validation in clinical applications is not within the scope of the this study but will be the focus of subsequent work building on the technical infrastructure outlined in this study. Importantly, as our standardized data acquisition protocol covers established risk factors and symptom profiles that have in part already been successfully used for predictive analytics in psychiatric cohort studies [91,93], it appears reasonable to assume their predictive validity for the intended prediction of symptom trajectories and functional outcomes in future work.

Results in Relation to Other Studies

Through our study, we extend a previous line of research on predictive modeling based on EHR data. While previous studies have demonstrated empirical evidence for the predictive validity of EHR data in psychiatric use cases [18-20], to the best of our knowledge, our study is the first to not only report on the design but also on the successful implementation and technical feasibility of the informatics infrastructure for standardized acquisition, transfer, storage, and access of real world data for analytic purposes in psychiatric care, which is the basic requirement for the application and validation of predictive models in future clinical studies. Although we are not aware of any other study that has reported successful implementation of a comparable informatics infrastructure in psychiatric clinical routine, several preliminary reports should be taken into account. Complementary to the work presented in this study, Khalilia et al [94] described a Fast Healthcare Interoperability Resources (FHIR) web modeling service that was tested on a pilot intensive care unit dataset. A multi-source approach was used. No binding standard is used for clinical studies; instead, the standard Observational Medical Outcomes Partnership Common Data Model was applied [95] and an FHIR server and database are required for this system, which might limit potential implementations at multiple sites, considering that many EHR systems currently do not yet use an FHIR server. Of note, we are aware of several large-scale efforts aiming to translate predictive models into psychiatric practice [96] that, once implemented, might serve as a future base for comparison of system stability and performance. Importantly, the presented infrastructure represents a flexible solution that allows compatibility with existing initiatives and concepts of data standardization such as the Common Data Elements repository of the National Institutes of Health [97]. The choice of the ODM as the data standard implies the automatic provision of a metadata provider for each item. Thus, data points can be enriched with additional codes based on standards such as the Systematized Nomenclature of Medicine Clinical Terms or the Unified Medical Language System [98,99]. The integration takes place via the alias node or the SMA:T schema extension of the ODM. This makes it possible to enrich the survey data with additional metadata. International standardizations are hence compatible with the operating data model based on a 1-1 mapping of item definition nodes.

Generalizability of This Study

The informatics infrastructure for standardized data acquisition, transfer, storage, and export in real time for future predictive modelling outlined in this study is an important step in the complex process toward the implementation of machine learning and clinical decision support solutions in routine care. Our study shows that this approach is technically feasible. Owing to the standardization, this concept is also scalable for other medical areas. Data warehouse applications of a heterogeneous hospital landscape can be implemented with this software architecture. In addition to local artificial intelligence applications, multi-site implementations of the architecture could also transfer pseudonymized data points into a global predictive model. The implementation of national and international predictive models in medicine would be possible.

Future Work

Artificial intelligence systems rely on high-quality data. In the future, artificial intelligence applications might send real-time evaluations directly back into the EHR system. Clinical staff could access and respond to calculated predictions. Selected data will be provided in a modular dashboard. Medical device regulation needs to be taken into account for implementation of such systems. Direct data transfer back from the clinic would be possible. Real-time adjustments of the prediction models would thus be possible. Standardization of clinical routine documentation via SMA:T can provide high-quality structured data points. It is planned to augment this database with further research data from existing cohort studies, for example, covering neuroimaging and genetic data. Specific prediction models can be trained in this way with the same architecture. Generic model pipelines can be set up. Model clusters can be set up to answer complex medical questions. Basically, SMA:T forms a solid technical infrastructure for the implementation of artificial intelligence solutions in medicine. Scheme extensions of the ODM standard can be implemented to optimize communication between systems. Observational and interventional studies are warranted to evaluate the predictive validity of machine learning models in psychiatric routine. For multi-center studies, SMA:T needs to be reimplemented in the respective EHR environments to process CDISC ODM files. A software blueprint is available [45]. If SMA:T and MoPat are already in use, the architecture can be set up within a short time frame of approximately 1 week. The generic concept of the architecture enables the reuse of our data models, database queries, and server architecture. Retrospective database queries might have to be reimplemented in the EHR environments. The necessary data can be used from our repository on GitHub [100]. Another important consideration is the potential future enrichment of EHR data with mobile assessments, including ecological momentary assessments and passive sensor data derived from smartphones. Recent reports on successful real-time prediction of depressive symptoms based on ecological momentary assessment data supplement this notion [101]. Thus, future studies should explore technical solutions that allow data transfer between EHRs and patients’ smartphones. Future work will evaluate the predictive potential of the acquired data entities by training and validating machine learning models for an individual level prediction of treatment response, functional outcome, and depression relapse. In accordance with findings from previous machine learning approaches in psychiatric cohort studies, in a first step, well-established predictive algorithms such as support vector machines will be trained on features covering risk and symptom profiles, sociodemographic variables, medication, and treatment history [7,91,93]. Yet importantly, as opposed to previous cohort studies, the technical infrastructure outlined in this study will allow to train and validate predictive models in naturalistic patient samples in routine care.

Conclusions

The presented informatics infrastructure enabling standardized data acquisition, transfer, storage, and export in real time for future predictive modelling in clinical routine in psychiatry is technically feasible. The outlined architecture provides a technical basis for the application, first and foremost, and the validation of clinical decision support systems and artificial intelligence applications in clinical studies.

Acknowledgments

We are deeply indebted to all the participants of this study. This study was supported by a grant from BMBF (HiGHmed 01ZZ1802V). Funding was provided by the Interdisciplinary Center for Clinical Research (IZKF) of the medical faculty of Münster (Grant SEED 11/19 to NO) as well as the “Innovative Medizinische Forschung” (IMF) of the medical faculty of Münster (Grants OP121710 to NO).

Authors' Contributions

RB and NO drafted the manuscript. RB developed the software architecture and conducted the statistical analyses. NO was responsible for the formulation of the overarching research goal and aims, the study design, conception, development of the methodology, management and coordination responsibility for the research activity planning and execution, data acquisition and design, creation of data models, and data analysis. NO and MD acquired financial support for the conduction of this study. MS supported programming and the implementation of computer code. All authors contributed to the interpretation of data. All authors critically reviewed and substantially revised the final manuscript.

Conflicts of Interest

None declared.

Multimedia Appendix 1

Beck Depression Inventory documentation form created with Single-source Metadata ARchitecture Transformation.

PNG File , 37 KB

Multimedia Appendix 2

Symptom Checklist-90 Somatization Scale documentation form created with Single-source Metadata ARchitecture Transformation.

PNG File , 119 KB

  1. Schmaal L, Hibar D, Sämann P, et al. Cortical abnormalities in adults and adolescents with major depression based on brain scans from 20 cohorts worldwide in the ENIGMA Major Depressive Disorder Working Group. Molecular Psychiatry volume 22 2017:900-909 [FREE Full text] [CrossRef]
  2. Hahn T, Marquand AF, Ehlis A, Dresler T, Kittel-Schneider S, Jarczok TA, et al. Integrating neurobiological markers of depression. Arch Gen Psychiatry 2011 Apr;68(4):361-368. [CrossRef] [Medline]
  3. Huys QJM, Maia TV, Frank MJ. Computational psychiatry as a bridge from neuroscience to clinical applications. Nat Neurosci 2016 Mar;19(3):404-413 [FREE Full text] [CrossRef] [Medline]
  4. Chekroud AM, Zotti RJ, Shehzad Z, Gueorguieva R, Johnson MK, Trivedi MH, et al. Cross-trial prediction of treatment outcome in depression: a machine learning approach. Lancet Psychiatry 2016 Mar;3(3):243-250. [CrossRef] [Medline]
  5. Koutsouleris N, Meisenzahl EM, Borgwardt S, Riecher-Rössler A, Frodl T, Kambeitz J, et al. Individualized differential diagnosis of schizophrenia and mood disorders using neuroanatomical biomarkers. Brain 2015 Jul;138(Pt 7):2059-2073 [FREE Full text] [CrossRef] [Medline]
  6. Redlich R, Opel N, Grotegerd D, Dohm K, Zaremba D, Bürger C, et al. Prediction of Individual Response to Electroconvulsive Therapy via Machine Learning on Structural Magnetic Resonance Imaging Data. JAMA Psychiatry 2016 Jun 01;73(6):557-564. [CrossRef] [Medline]
  7. Kessler RC, van Loo HM, Wardenaar KJ, Bossarte RM, Brenner LA, Cai T, et al. Testing a machine-learning algorithm to predict the persistence and severity of major depressive disorder from baseline self-reports. Mol Psychiatry 2016 Oct;21(10):1366-1371 [FREE Full text] [CrossRef] [Medline]
  8. Opel N, Zwanzger P, Redlich R, Grotegerd D, Dohm K, Arolt V, et al. Differing brain structural correlates of familial and environmental risk for major depressive disorder revealed by a combined VBM/pattern recognition approach. Psychol. Med 2015 Sep 10;46(2):277-290. [CrossRef]
  9. Koutsouleris N, Meisenzahl EM, Borgwardt S, Riecher-Rössler A, Frodl T, Kambeitz J, et al. Individualized differential diagnosis of schizophrenia and mood disorders using neuroanatomical biomarkers. Brain 2015 Jul;138(Pt 7):2059-2073 [FREE Full text] [CrossRef] [Medline]
  10. Redlich R, Almeida JJR, Grotegerd D, Opel N, Kugel H, Heindel W, et al. Brain morphometric biomarkers distinguishing unipolar and bipolar depression. A voxel-based morphometry-pattern classification approach. JAMA Psychiatry 2014 Nov;71(11):1222-1230 [FREE Full text] [CrossRef] [Medline]
  11. Cearns M, Opel N, Clark S, Kaehler C, Thalamuthu A, Heindel W, et al. Predicting rehospitalization within 2 years of initial patient admission for a major depressive episode: a multimodal machine learning approach. Transl Psychiatry 2019 Nov 11;9(1):285 [FREE Full text] [CrossRef] [Medline]
  12. Hirschtritt ME, Insel TR. Digital Technologies in Psychiatry: Present and Future. Focus (Am Psychiatr Publ) 2018 Jul;16(3):251-258 [FREE Full text] [CrossRef] [Medline]
  13. Hsin H, Fromer M, Peterson B, Walter C, Fleck M, Campbell A, et al. Transforming Psychiatry into Data-Driven Medicine with Digital Measurement Tools. NPJ Digit Med 2018;1:37 [FREE Full text] [CrossRef] [Medline]
  14. Rutledge RB, Chekroud AM, Huys QJ. Machine learning and big data in psychiatry: toward clinical applications. Curr Opin Neurobiol 2019 Apr;55:152-159. [CrossRef] [Medline]
  15. Torous J, Baker JT. Why Psychiatry Needs Data Science and Data Science Needs Psychiatry: Connecting With Technology. JAMA Psychiatry 2016 Jan;73(1):3-4. [CrossRef] [Medline]
  16. Zimmerman M, Clark HL, Multach MD, Walsh E, Rosenstein LK, Gazarian D. Have Treatment Studies of Depression Become Even Less Generalizable? A Review of the Inclusion and Exclusion Criteria Used in Placebo-Controlled Antidepressant Efficacy Trials Published During the Past 20 Years. Mayo Clin Proc 2015 Sep;90(9):1180-1186. [CrossRef] [Medline]
  17. Skyttberg N, Vicente J, Chen R, Blomqvist H, Koch S. How to improve vital sign data quality for use in clinical decision support systems? A qualitative study in nine Swedish emergency departments. BMC Med Inform Decis Mak 2016 Jun 04;16:61 [FREE Full text] [CrossRef] [Medline]
  18. Barak-Corren Y, Castro VM, Javitt S, Hoffnagle AG, Dai Y, Perlis RH, et al. Predicting Suicidal Behavior From Longitudinal Electronic Health Records. Am J Psychiatry 2017 Feb 01;174(2):154-162. [CrossRef] [Medline]
  19. Barak-Corren Y, Castro VM, Nock MK, Mandl KD, Madsen EM, Seiger A, et al. Validation of an Electronic Health Record-Based Suicide Risk Prediction Modeling Approach Across Multiple Health Care Systems. JAMA Netw Open 2020 Mar 02;3(3):e201262 [FREE Full text] [CrossRef] [Medline]
  20. Blumenthal SR, Castro VM, Clements CC, Rosenfield HR, Murphy SN, Fava M, et al. An electronic health records study of long-term weight gain following antidepressant use. JAMA Psychiatry 2014 Aug;71(8):889-896. [CrossRef] [Medline]
  21. Simon GE. Big Data From Health Records in Mental Health Care: Hardly Clairvoyant but Already Useful. JAMA Psychiatry 2019 Apr 01;76(4):349-350. [CrossRef] [Medline]
  22. Weissman MM, Pathak J, Talati A. Personal Life Events-A Promising Dimension for Psychiatry in Electronic Health Records. JAMA Psychiatry 2020 Feb 01;77(2):115-116 [FREE Full text] [CrossRef] [Medline]
  23. Musliner KL, Mortensen PB, McGrath JJ, Suppli NP, Hougaard DM, Bybjerg-Grauholm J, Bipolar Disorder Working Group of the Psychiatric Genomics Consortium. Association of Polygenic Liabilities for Major Depression, Bipolar Disorder, and Schizophrenia With Risk for Depression in the Danish Population. JAMA Psychiatry 2019 May 01;76(5):516-525 [FREE Full text] [CrossRef] [Medline]
  24. Epic Systems Corporation.   URL: https://www.epic.com [accessed 2021-05-25]
  25. Richter M, Storck M, Blitz R, Goltermann J, Seipp J, Dannlowski U, et al. Continuous digital collection of patient-reported outcomes during inpatient treatment for affective disorders - implementation and feasibility. MedRxiv. Preprint posted online on September 1, 2020. [CrossRef]
  26. Geschäftsbericht 2019. UKM.   URL: https:/​/www.​ukm.de/​fileadmin/​ukminternet/​daten/​zentralauftritt/​presse-services/​downloads/​Logbuch/​UKM-Geschaeftsbericht-2019/​index.html#book/​page/​64-65 [accessed 2021-05-25]
  27. Qualitätsbericht 2018. UKM.   URL: https://www.g-ba-qualitaetsberichte.de/api/file/download/260550131-00-2018-xml.pdf [accessed 2021-05-25]
  28. Geschäftsbereich medizinisches management medizincontrolling. UKM.   URL: https://www.ukm.de/index.php?id=mc [accessed 2021-05-25]
  29. Dedalus to acquire Agfa's HCIS business: The signify view. Signify Research.   URL: https://www.signifyresearch.net/digital-health/dedalus-acquire-agfas-hcis-business-signify-view/ [accessed 2021-05-25]
  30. Operational data model (ODM)-XML. CDISC.   URL: https://www.cdisc.org/standards/data-exchange/odm [accessed 2021-05-25]
  31. Define-XML. CDISC.   URL: https://www.cdisc.org/standards/data-exchange/define-xml [accessed 2021-05-25]
  32. Health Level Seven International (HL7).   URL: http://www.hl7.org/ [accessed 2021-05-25]
  33. NextGen Connect (formerly Mirth Connect).   URL: https://www.nextgen.com/ [accessed 2021-05-25]
  34. Java. Oracle.   URL: https://java.com [accessed 2021-05-25]
  35. ECMAScript 6. W3 Schools.   URL: https://www.w3schools.com/js/js_es6.asp [accessed 2021-05-25]
  36. What is TypeScript? TypeScript.   URL: https://www.typescriptlang.org/ [accessed 2021-05-25]
  37. MongoDB Java drivers. MongoDB.   URL: https://mongodb.github.io/mongo-java-driver/ [accessed 2021-05-25]
  38. Json-lib. Sourceforge.   URL: https://sourceforge.net/projects/json-lib/files/json-lib/json-lib-2.4/ [accessed 2021-05-25]
  39. MongoDB.   URL: https://www.mongodb.com/ [accessed 2021-05-25]
  40. Docker.   URL: https://www.docker.com/ [accessed 2021-05-25]
  41. Red Hat.   URL: https://www.redhat.com [accessed 2021-05-25]
  42. SPSS statistics. IBM.   URL: https://www.ibm.com/analytics/spss-statistics-software [accessed 2021-05-25]
  43. Adobe photoshop. Adobe.   URL: https://www.adobe.com/de/products/photoshop.html [accessed 2021-05-25]
  44. Microsoft Visio.   URL: https://products.office.com/de-de/visio/flowchart-software?tab=tabs-1 [accessed 2021-05-25]
  45. Blitz R, Dugas M. Conceptual Design, Implementation, and Evaluation of Generic and Standard-Compliant Data Transfer into Electronic Health Records. Appl Clin Inform 2020 May;11(3):374-386. [CrossRef] [Medline]
  46. Stahl T, Efftinge S, Haase A, Volter M. Modellgetriebene Softwareentwicklung: Techniken, Engineering, Management. Germany: dpunkt.verlag; 2012.
  47. Soto-Rey I, Rehr M, Bruland P, Zeidler C, Riepe C, Steinke S, et al. Electronic Collection of Multilingual Patient-Reported Outcomes across Europe. Methods Inf Med 2018 Dec;57(S 02):e107-e114 [FREE Full text] [CrossRef] [Medline]
  48. Soto-Rey, Dugas M, Storck M. Implementation of an ODM and HL7 Compliant Electronic Patient-Reported Outcome System. Stud Health Technol Inform 2016;228:421-425. [Medline]
  49. Storck M, Dugas-Breit S, Dugas M, Soto-Rey I. MOPAT@HOME: electronic patient reported outcomes filled out at home, evaluated at the hospital. In: Studies in Health Technology and Informatics; Volume 244-The Practice of Patient Centered Care-Empowering and Engaging Patients in the Digital Era. Amsterdam, Netherlands: iOS Press; 2017.
  50. REDCap.   URL: https://www.project-redcap.org/ [accessed 2021-05-25]
  51. Kannan V, Basit M, Youngblood J. Agile co-development for clinical adoption and adaptation of innovative technologies. In: Health Innov Point Care Conf. 2017 Presented at: 2017 IEEE Healthcare Innovations and Point of Care Technologies (HI-POCT); 6-8 Nov 2017; Bethesda. [CrossRef]
  52. Beck AT, Steer RA, Brown GK. Manual for the Beck Depression Inventory-II. San Antonio: APA PsycNet; 1996.
  53. Kühner C, Bürger C, Keller F, Hautzinger M. Reliability and validity of the Revised Beck Depression Inventory (BDI-II). Results from German samples. Nervenarzt 2007 Jun;78(6):651-656. [CrossRef] [Medline]
  54. Soto CJ, John OP. Short and extra-short forms of the Big Five Inventory–2: The BFI-2-S and BFI-2-XS. Journal of Research in Personality 2017 Jun;68:69-81. [CrossRef]
  55. Childhood trauma questionnaire. APA PsycNet. 1994.   URL: https://doi.org/10.1037/t02080-000 [accessed 2021-05-25]
  56. Hamilton M. The Hamilton rating scale for depression. In: Assessment of Depression. Berlin Heidelberg: Springer-Verlag; 1986:143-152.
  57. Leckelt M, et al. Supplemental Material for Validation of the Narcissistic Admiration and Rivalry Questionnaire Short Scale (NARQ-S) in Convenience and Representative Samples. Psychological Assessment 2017 Mar 2:86-96. [CrossRef]
  58. The SCL-90-R and brief symptom inventory, and matching clinical rating scales. APA PsycNet.   URL: https://psycnet.apa.org/record/1999-02767-022 [accessed 2021-05-25]
  59. JavaScript Object Notation.   URL: https://www.json.org/json-en.html [accessed 2021-05-25]
  60. JavaScript operational data model. GitHub.   URL: https://github.com/764374625489/Seed/blob/main/datamodels/jodm [accessed 2021-05-25]
  61. JavaScript Object Notation Schema.   URL: https://json-schema.org/ [accessed 2021-05-25]
  62. Bargas-Avila J, Brenzikofer O, Roth S, User A. Simple but crucial user interfaces in the world wide web: introducing 20 guidelines for usable web form design. IntechOpen. 2010.   URL: https:/​/www.​intechopen.com/​books/​user-interfaces/​simple-but-crucial-user-interfaces-in-the-world-wide-web-introducing-20-guidelines-for-usable-web-fo [accessed 2021-05-28]
  63. Idrus Z, Razak N. Using three layer model (TLM) in web form design: WeFDeC checklist development. In: ICCEA NTAA. 2010 Presented at: 2010 Second International Conference on Computer Engineering and Applications; 19-21 March 2010; Bali. [CrossRef]
  64. Research-based web design and usability guidelines. U.S. Department of Health & Human Services. 2010.   URL: https://www.usability.gov/sites/default/files/documents/guidelines_book.pdf [accessed 2021-05-28]
  65. Seckler M, Heinz S, Bargas-Avila J. Designing usable web forms: Empirical evaluation of web form improvement guidelines. New York, USA: ACM; 2014 Presented at: Proceedings of the 2014 Annual Conference on Human Factors in Computing Systems; 2014; Toronto. [CrossRef]
  66. Beck depression inventory. Portal für Medizinische Datenmodelle (MDM-Portal).   URL: https://medical-data-models.org/27300 [accessed 2021-05-25]
  67. Big five inventory (BFI-2-S) Big five - SEED studie. Portal für Medizinische Datenmodelle (MDM-Portal).   URL: https://medical-data-models.org/41674 [accessed 2021-05-25]
  68. Big five inventory (BFI-2-XS) - SEED studie. Portal für Medizinische Datenmodelle (MDM-Portal).   URL: https://medical-data-models.org/41673 [accessed 2021-05-25]
  69. Childhood trauma questionnaire (CTQ). Portal für Medizinische Datenmodelle (MDM-Portal).   URL: https://medical-data-models.org/9946 [accessed 2021-05-25]
  70. Family mental history - SEED-studie. Portal für Medizinische Datenmodelle (MDM-Portal).   URL: https://medical-data-models.org/41444 [accessed 2021-05-25]
  71. Worboys M. The Hamilton Rating Scale for Depression: The making of a "gold standard" and the unmaking of a chronic illness, 1960-1980. Chronic Illn 2013 Sep;9(3):202-219 [FREE Full text] [CrossRef] [Medline]
  72. NARQ-S. Portal für Medizinische Datenmodelle (MDM-Portal).   URL: https://medical-data-models.org/41440 [accessed 2021-05-25]
  73. SCL-90 somatisation scale - SEED-studie. Portal für Medizinische Datenmodelle (MDM-Portal).   URL: https://medical-data-models.org/41671 [accessed 2021-05-25]
  74. Sociodemographic questionnaire - SEED-studie. Portal für Medizinische Datenmodelle (MDM-Portal).   URL: https://medical-data-models.org/41441 [accessed 2021-05-25]
  75. Questions on individual disease course - SEED-studie. Portal für Medizinische Datenmodelle (MDM-Portal).   URL: https://medical-data-models.org/41442 [accessed 2021-05-25]
  76. Questions on somatic comorbidities - SEED-studie. Portal für Medizinische Datenmodelle (MDM-Portal).   URL: https://medical-data-models.org/41443 [accessed 2021-05-25]
  77. Seed clinical data admission date/time. GitHub.   URL: https://github.com/764374625489/Seed/blob/main/datamodels/odm/2020_0000037.xml [accessed 2021-05-25]
  78. Seed clinical data classification. GitHub.   URL: https://github.com/764374625489/Seed/blob/main/datamodels/odm/2020_0000036.xml [accessed 2021-05-25]
  79. Seed clinical data DRG/diagnosis. GitHub.   URL: https://github.com/764374625489/Seed/blob/main/datamodels/odm/2020_0000032.xml [accessed 2021-05-25]
  80. Seed clinical data ECT. GitHub.   URL: https://github.com/764374625489/Seed/blob/main/datamodels/odm/2020_0000035.xml [accessed 2021-05-25]
  81. Seed Clinical data laboratory assessments. GitHub.   URL: https://github.com/764374625489/Seed/blob/main/datamodels/odm/2020_0000031.xml [accessed 2021-05-25]
  82. Seed clinical data medication. GitHub.   URL: https://github.com/764374625489/Seed/blob/main/datamodels/odm/2020_0000052.xml [accessed 2021-05-25]
  83. Seed clinical data patient. GitHub.   URL: https://github.com/764374625489/Seed/blob/main/datamodels/odm/2020_0000039.xml [accessed 2021-05-25]
  84. Seed clinical data vital signs. GitHub.   URL: https://github.com/764374625489/Seed/blob/main/datamodels/odm/2020_0000053.xml [accessed 2021-05-25]
  85. Kim E, Rubinstein SM, Nead KT, Wojcieszynski AP, Gabriel PE, Warner JL. The Evolving Use of Electronic Health Records (EHR) for Research. Semin Radiat Oncol 2019 Oct;29(4):354-361. [CrossRef] [Medline]
  86. Skyttberg N, Chen R, Koch S. Man vs machine in emergency medicine - a study on the effects of manual and automatic vital sign documentation on data quality and perceived workload, using observational paired sample data and questionnaires. BMC Emerg Med 2018 Dec 13;18(1):54 [FREE Full text] [CrossRef] [Medline]
  87. Butler A, Wei W, Yuan C, Kang T, Si Y, Weng C. The Data Gap in the EHR for Clinical Research Eligibility Screening. AMIA Jt Summits Transl Sci Proc 2018;2017:320-329 [FREE Full text] [Medline]
  88. FDA resources for data standards. US Food and Drug Administration.   URL: https://www.fda.gov/industry/fda-resources-data-standards [accessed 2021-05-25]
  89. Genomics England uses MongoDB to power the data science behind the 100,000 genomes project. MongoDB.   URL: https:/​/www.​mongodb.com/​press/​genomics-england-uses-mongodb-to-power-the-data-science-behind-the-100000-genomes-project [accessed 2021-05-25]
  90. Esteva A, Kuprel B, Novoa RA, Ko J, Swetter SM, Blau HM, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature 2017 Jan 25;542(7639):115-118. [CrossRef]
  91. Kambeitz J, Cabral C, Sacchet MD, Gotlib IH, Zahn R, Serpa MH, et al. Detecting Neuroimaging Biomarkers for Depression: A Meta-analysis of Multivariate Pattern Recognition Studies. Biol Psychiatry 2017 Sep 01;82(5):330-338. [CrossRef] [Medline]
  92. Cearns M, Opel N, Clark S, Kaehler C, Thalamuthu A, Heindel W, et al. Predicting rehospitalization within 2 years of initial patient admission for a major depressive episode: a multimodal machine learning approach. Transl Psychiatry 2019 Nov 11;9(1):285 [FREE Full text] [CrossRef] [Medline]
  93. Koutsouleris N, Dwyer DB, Degenhardt F, Maj C, Urquijo-Castro MF, Sanfelici R, PRONIA Consortium. Multimodal Machine Learning Workflows for Prediction of Psychosis in Patients With Clinical High-Risk Syndromes and Recent-Onset Depression. JAMA Psychiatry 2021 Feb 01;78(2):195-209 [FREE Full text] [CrossRef] [Medline]
  94. Khalilia M, Choi M, Henderson A, Iyengar S, Braunstein M, Sun J. Clinical Predictive Modeling Development and Deployment through FHIR Web Services. AMIA Annu Symp Proc 2015;2015:717-726 [FREE Full text] [Medline]
  95. OMOP data model. Observational Medical Outcomes Partnership.   URL: https://www.ohdsi.org/data-standardization/the-common-data-model/ [accessed 2021-05-25]
  96. Fusar-Poli P, Oliver D, Spada G, Patel R, Stewart R, Dobson R, et al. Real World Implementation of a Transdiagnostic Risk Calculator for the Automatic Detection of Individuals at Risk of Psychosis in Clinical Routine: Study Protocol. Front Psychiatry 2019;10:109 [FREE Full text] [CrossRef] [Medline]
  97. NIH common data elements repository. National Library of Medicine.   URL: https://cde.nlm.nih.gov/ [accessed 2021-05-25]
  98. Systematized Nomenclature of Medicine.   URL: https://www.snomed.org/ [accessed 2021-05-25]
  99. Unified Medical Language System.   URL: https://www.nlm.nih.gov/research/umls/index.html [accessed 2021-05-25]
  100. Seed project. GitHub.   URL: https://github.com/764374625489/Seed [accessed 2021-05-25]
  101. Hallensleben N, Glaesmer H, Forkmann T, Rath D, Strauss M, Kersting A, et al. Predicting suicidal ideation by interpersonal variables, hopelessness and depression in real-time. An ecological momentary assessment study in psychiatric inpatients with depression. Eur Psychiatry 2019 Feb;56:43-50. [CrossRef] [Medline]


CDISC: Clinical Data Interchange Standards Consortium
EHR: electronic health record
FHIR: Fast Healthcare Interoperability Resources
HL7: Health Level 7
MoPat: Mobile Patient Survey
ODM: operational data model
SMA:T: Single-source Metadata ARchitecture Transformation


Edited by J Torous; submitted 21.12.20; peer-reviewed by X Jing, A Sule; comments to author 25.01.21; revised version received 14.03.21; accepted 02.04.21; published 09.06.21

Copyright

©Rogério Blitz, Michael Storck, Bernhard T Baune, Martin Dugas, Nils Opel. Originally published in JMIR Mental Health (https://mental.jmir.org), 09.06.2021.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Mental Health, is properly cited. The complete bibliographic information, a link to the original publication on https://mental.jmir.org/, as well as this copyright and license information must be included.