File Download

  • Find it @ UNIST can give you direct access to the published full text of this article. (UNISTARs only)

Views & Downloads

Detailed Information

Cited time in webofscience Cited time in scopus
Metadata Downloads

Full metadata record

DC Field Value Language
dc.citation.endPage 426 -
dc.citation.startPage 403 -
dc.citation.title INFORMATION SCIENCES -
dc.citation.volume 568 -
dc.contributor.author Lee, Taek-Ho -
dc.contributor.author Lee, Junghye -
dc.contributor.author Jun, Chi-Hyuck -
dc.date.accessioned 2023-12-21T15:37:36Z -
dc.date.available 2023-12-21T15:37:36Z -
dc.date.created 2021-04-19 -
dc.date.issued 2021-08 -
dc.description.abstract Sharing electronic health record data is essential for advanced analysis, but may put sensitive information at risk. Several studies have attempted to address this risk using contextual embedding, but with many hospitals involved, they are often inefficient and inflexible. Thus, we propose a bilingual autoencoder-based model to harmonize local embeddings in different spaces. Cross-hospital reconstruction of embeddings makes encoders map embeddings from hospitals to a shared space and align them spontaneously. We also suggest two-phase training to prevent distortion of embeddings during harmonization with hospitals that have biased information. In experiments, we used medical event sequences from the Medical Information Mart for Intensive Care-III dataset and simulated the situation of multiple hospitals. For evaluation, we measured the alignment of events from different hospitals and the prediction accuracy of a patient & rsquo;s diagnosis in the next admission in three scenarios in which local embeddings do not work. The proposed method efficiently harmonizes embeddings in different spaces, increases prediction accuracy, and gives flexibility to include new hospitals, so is superior to previous methods in most cases. It will be useful in predictive tasks to utilize distributed data while preserving private information. -
dc.identifier.bibliographicCitation INFORMATION SCIENCES, v.568, pp.403 - 426 -
dc.identifier.doi 10.1016/j.ins.2021.03.064 -
dc.identifier.issn 0020-0255 -
dc.identifier.scopusid 2-s2.0-85104930760 -
dc.identifier.uri https://scholarworks.unist.ac.kr/handle/201301/52735 -
dc.identifier.url https://www.sciencedirect.com/science/article/pii/S0020025521003157?via%3Dihub -
dc.identifier.wosid 000657236800004 -
dc.language 영어 -
dc.publisher ELSEVIER SCIENCE INC -
dc.title Bilingual Autoencoder-based Efficient Harmonization of Multi-source Private Data for Accurate Predictive Modeling -
dc.type Article -
dc.description.isOpenAccess TRUE -
dc.relation.journalWebOfScienceCategory Computer Science, Information Systems -
dc.relation.journalResearchArea Computer Science -
dc.type.docType Article -
dc.description.journalRegisteredClass scie -
dc.description.journalRegisteredClass scopus -
dc.subject.keywordAuthor Distributed EHR -
dc.subject.keywordAuthor Contextual embedding -
dc.subject.keywordAuthor Space alignment -
dc.subject.keywordAuthor Autoencoder -
dc.subject.keywordAuthor Predictive tasks -
dc.subject.keywordPlus REGRESSION -
dc.subject.keywordPlus SECURE -

qrcode

Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.