详细信息
Supporting Regularized Logistic Regression Privately and Efficiently ( SCI-EXPANDED收录)
文献类型:期刊文献
英文题名:Supporting Regularized Logistic Regression Privately and Efficiently
作者:Li, Wenfa[1];Liu, Hongzhe[1];Yang, Peng[1];Xie, Wei[2]
通讯作者:Xie, W[1]
机构:[1]Beijing Union Univ, Beijing Key Lab Informat Serv Engn, Beijing 100101, Peoples R China;[2]Vanderbilt Univ, Dept Elect Engn & Comp Sci, Nashville, TN 37232 USA
第一机构:北京联合大学北京市信息服务工程重点实验室
通讯机构:[1]corresponding author), Vanderbilt Univ, Dept Elect Engn & Comp Sci, Nashville, TN 37232 USA.
年份:2016
卷号:11
期号:6
外文期刊名:PLOS ONE
收录:;Scopus(收录号:2-s2.0-84974662989);WOS:【SCI-EXPANDED(收录号:WOS:000377560200010)】;
基金:This work was partly supported by The National Nature Science Foundation of China (No. 61300078), and The Importation and Development of High-Caliber Talents Project of Beijing Municipal Institutions (CIT&TCD20130320, CIT&TCD201504039), and Funding Project for Academic Human Resources Development in Beijing Union University (Zk80201403, Rk100201510). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
语种:英文
摘要:As one of the most popular statistical and machine learning models, logistic regression with regularization has found wide adoption in biomedicine, social sciences, information technology, and so on. These domains often involve data of human subjects that are contingent upon strict privacy regulations. Concerns over data privacy make it increasingly difficult to coordinate and conduct large-scale collaborative studies, which typically rely on cross-institution data sharing and joint analysis. Our work here focuses on safeguarding regularized logistic regression, a widely-used statistical model while at the same time has not been investigated from a data security and privacy perspective. We consider a common use scenario of multi-institution collaborative studies, such as in the form of research consortia or networks as widely seen in genetics, epidemiology, social sciences, etc. To make our privacy- enhancing solution practical, we demonstrate a non-conventional and computationally efficient method leveraging distributing computing and strong cryptography to provide comprehensive protection over individual-level and summary data. Extensive empirical evaluations on several studies validate the privacy guarantee, efficiency and scalability of our proposal. We also discuss the practical implications of our solution for large-scale studies and applications from various disciplines, including genetic and biomedical studies, smart grid, network analysis, etc.
参考文献:
正在载入数据...