Development of a web service for analysis in a distributed network.
Abstract
OBJECTIVE: We describe functional specifications and practicalities in the software
development process for a web service that allows the construction of the multivariate
logistic regression model, Grid Logistic Regression (GLORE), by aggregating partial
estimates from distributed sites, with no exchange of patient-level data. BACKGROUND:
We recently developed and published a web service for model construction and data
analysis in a distributed environment. This recent paper provided an overview of the
system that is useful for users, but included very few details that are relevant for
biomedical informatics developers or network security personnel who may be interested
in implementing this or similar systems. We focus here on how the system was conceived
and implemented. METHODS: We followed a two-stage development approach by first implementing
the backbone system and incrementally improving the user experience through interactions
with potential users during the development. Our system went through various stages
such as concept proof, algorithm validation, user interface development, and system
testing. We used the Zoho Project management system to track tasks and milestones.
We leveraged Google Code and Apache Subversion to share code among team members, and
developed an applet-servlet architecture to support the cross platform deployment.
DISCUSSION: During the development process, we encountered challenges such as Information
Technology (IT) infrastructure gaps and limited team experience in user-interface
design. We figured out solutions as well as enabling factors to support the translation
of an innovative privacy-preserving, distributed modeling technology into a working
prototype. CONCLUSION: Using GLORE (a distributed model that we developed earlier)
as a pilot example, we demonstrated the feasibility of building and integrating distributed
modeling technology into a usable framework that can support privacy-preserving, distributed
data analysis among researchers at geographically dispersed institutes.
Type
Journal articlePermalink
https://hdl.handle.net/10161/15405Published Version (Please cite this version)
10.13063/2327-9214.1053Publication Info
Jiang, X; Wu, Y; Marsolo, K; & Ohno-Machado, L (2014). Development of a web service for analysis in a distributed network. EGEMS (Wash DC), 2(1). pp. 1053. 10.13063/2327-9214.1053. Retrieved from https://hdl.handle.net/10161/15405.This is constructed from limited available data and may be imprecise. To cite this
article, please review & use the official citation provided by the journal.
Collections
More Info
Show full item recordScholars@Duke
Keith Allen Marsolo
Associate Professor in Population Health Sciences
Dr. Marsolo is a faculty member in the Department of Population Health Sciences (DPHS)
and a member of the Duke Clinical Research Institute (DCRI). His current research focuses
on infrastructure to support the use of electronic health records (EHRs) and other
real-world data sources in observational and comparative effectiveness research and
public health surveillance, as well as standards and architectures for multi-center
learning health systems
Yuan Wu
Associate Professor in Biostatistics & Bioinformatics
Survival analysis, Sequential clinical trial design, Machine learning, Causal inference,
Non/Semi-parametric method, Statistical computing
Alphabetical list of authors with Scholars@Duke profiles.

Articles written by Duke faculty are made available through the campus open access policy. For more information see: Duke Open Access Policy
Rights for Collection: Scholarly Articles
Works are deposited here by their authors, and represent their research and opinions, not that of Duke University. Some materials and descriptions may include offensive content. More info