An R Script for Assessment of Data Quality in the BioSense Locker Database

How to Cite

Rezny, S., & Hoferka, S. (2016). An R Script for Assessment of Data Quality in the BioSense Locker Database. Online Journal of Public Health Informatics, 8(1).


Syndromic surveillance requires reliable, accurate, and complete healthcare encounter data. To address the need for quality assessment of ED data, we developed an R script to assess and produce reports on data quality in the BioSense locker database. The script examines identifying variables in the HL7 messages from the locker, aggregates messages into ED visits based on these identifiers, processes the aggregated data to calculate metadata for each visit, and computes various data quality metrics. Facility-level reports are written to HTML files, which can then be shared with hospitals and vendors to support ongoing data quality improvements.
Authors own copyright of their articles appearing in the Online Journal of Public Health Informatics. Readers may copy articles without permission of the copyright owner(s), as long as the author and OJPHI are acknowledged in the copy and the copy is used for educational, not-for-profit purposes. Share-alike: when posting copies or adaptations of the work, release the work under the same license as the original. For any other use of articles, please contact the copyright owner. The journal/publisher is not responsible for subsequent uses of the work, including uses infringing the above license. It is the author's responsibility to bring an infringement action if so desired by the author.