Roman Erick Emde

Roman Erick Emde

IT Specialist / Data Scientist

About Me

Experienced IT specialist with extensive expertise in programming, systems administration, and data analytics. Proven track record of implementing solutions that improve data reliability and accessibility. Strong background in version control systems, data quality assurance, and cloud technologies. Currently pursuing an MS in Data Analytics at Georgia Tech to further enhance technical capabilities and analytical skills.

Professional Experience

IT Specialist (Project Management)

Centers for Disease Control and Prevention (CDC)

National Center for Injury Prevention and Control, Division of Injury Prevention
Data Analytics Branch, Programming and Web Applications Team

December 2018 - Present

Lead programmer and systems administrator for the Web-based Injury Statistics Query and Reporting System (WISQARS, https://wisqars.cdc.gov/), a national data repository for injury statistics serving thousands of public health professionals, researchers, and policy makers. Responsible for ensuring system reliability, data quality, and implementing new features to enhance data accessibility and visualization capabilities.

Key Accomplishments:

  • Implemented a Git/DVC hybrid version control system, enabling efficient management of over 500GB of versioned data across cloud platforms (AWS S3 and Azure blob storage), reducing deployment errors by 95%.
  • Developed Python automation scripts that reduced data validation time from 3 weeks to 3 days for annual data releases, ensuring 99.9% data accuracy compared to source databases.
  • Created a machine learning solution that automatically validates narrative field consistency in 500,000+ annual NEISS-AIP injury records, identifying discrepancies that manual review had previously missed.
  • Established a biweekly code deployment pipeline using Jenkins, successfully executing over 50 production deployments per year with zero critical failures.
  • Led development of the WISQARS Community Health Factors tool, enabling comparative analysis of injury data across social vulnerability indices for all 3,142 U.S. counties.

Information Systems Specialist

Centers for Disease Control and Prevention (CDC)

Human Resources IT Department

February 2011 - December 2018

Served as Learning Portal Project Lead for CDC Corporate University Division, managing the agency-wide learning management system that served over 15,000 CDC employees and contractors. Responsible for system administration, integration with HR systems, and developing custom applications to enhance learning delivery.

Key Accomplishments:

  • Successfully led the selection and implementation of the Saba learning management system, migrating 7,500+ learning records and 300+ courses with zero data loss.
  • Developed 5 custom SharePoint applications including a teacher expertise database that connected 300+ internal subject matter experts with training needs across the agency.
  • Created and maintained documentation and training materials that reduced help desk tickets by 40% and enabled successful knowledge transfer to 25+ LMS administrators.
  • Established integration between HR systems and the LMS, automating record updates for 1,000+ monthly training completions.
  • Provided technical consulting to 12 CDC programs implementing their own e-learning solutions, resulting in standardized approaches across the agency.

Instructional Technologist

CDC Information Technology Support Contract (CITS) - Northrop Grumman IS

July 2006 - February 2011

Provided technical consulting services for CDC's learning technology systems, focusing on LMS implementation, administration, and support. Collaborated with stakeholders to define requirements, evaluate solutions, and implement learning technologies.

Key Accomplishments:

  • Contributed to requirements gathering and selection process for agency-wide LMS, evaluating 8 potential solutions against 150+ functional requirements.
  • Trained over 200 staff members on LMS functionality, resulting in 95% user satisfaction ratings.
  • Developed technical documentation for 3 major LMS upgrades, ensuring smooth transitions with minimal service disruption.

Featured Projects

WISQARS Community Health Factors tool showing a map of the United States with fatal injury rates and social vulnerability index

WISQARS Community Health Factors

Interactive tool enabling comparative analysis of injury data across social vulnerability indices for all U.S. counties.

View Project
Economic Cost of Opioid Use Disorder and Fatal Overdose dashboard showing a map of the United States with economic impact by state

Opioid Economic Burden Dashboard

Visualization tool displaying the economic impact of opioid-related injuries and fatalities across the United States.

View WISQARS Portal
Git/DVC Hybrid Version Control Workflow Local Dev Environment DVC Commit & Push Code Data Cloud Storage S3 AWS Blob Azure DEV TEST PROD Make Changes Push Code (Git) Push Data (DVC) Deploy Tracking 500GB+ of large data files across environments Automated Deployment Pipeline Seamlessly deploy code and data to test, stage, and production environments

Git/DVC Hybrid Version Control System

Custom implementation enabling efficient management of 500GB+ of versioned data across AWS S3 and Azure blob storage.

View Details

Implemented an innovative version control solution that combines Git and Data Version Control (DVC) to efficiently manage over 500GB of data assets while maintaining robust version control practices. This hybrid system addresses a critical challenge in large-scale data management: tracking massive data files that exceed traditional Git capacity.

The workflow begins in the local development environment, where code changes are tracked by Git while large data files are managed by DVC. When changes are committed, DVC automatically pushes data files to optimized cloud storage (either AWS S3 or Azure blob storage) and maintains references in the Git repository.

Key Technical Achievements:

  • Engineered a seamless integration between Git and DVC, creating a unified version control experience for all project assets
  • Implemented automated pipelines for deploying both code and data to test, stage, and production environments
  • Reduced deployment errors by 95% through consistent versioning of both code and data assets
  • Enabled efficient team collaboration on large datasets without compromising version integrity
  • Created custom automation scripts that optimize cloud storage usage while maintaining full version history
  • Established rollback procedures for rapid recovery from data-related incidents

This system has become a critical infrastructure component for the WISQARS platform, ensuring data consistency across environments while supporting the platform's continuous evolution and expansion.

Technical Skills

Programming Languages

Python Advanced
SQL Advanced
SAS Intermediate
JavaScript/Node.js Intermediate
R Basic

Cloud & DevOps

Amazon Web Services (AWS)
Azure
Git/GitHub
DVC
Jenkins

Data Science & Analytics

Machine Learning
Natural Language Processing
Data Visualization
Statistical Analysis

Database Technologies

MS SQL Server
SAS data files
MySQL

Education

Master of Science in Analytics, Computational Data Analytics Track

Georgia Institute of Technology, Atlanta, GA

Expected 2026

Relevant Coursework: Computing for Data Analysis (CSE 6040), Introduction to Analytics Modeling (ISYE 6501), Computational Data Analysis: Learning, Mining, and Computation (ISYE 6740), Applied Natural Language Processing (CSE 8803)

Master of Music, Music Theory

Georgia State University, Atlanta, GA

Relevant Focus: Data Analysis with Technology, Multimedia Presentation

Bachelor of Arts in Music

Elmhurst College, Elmhurst, IL

Awards & Recognition

Time Off Award: Technical Lead Role in Identifying Data Discrepancies

2023

  • Led technical effort to identify and resolve data discrepancies in the WISQARS Community Health Factors tool
  • Developed Python scripts for automated comparison of data sources
  • Contributed to the successful public release of the Health Equity module

Special Act or Service Award: Providing Provisional Injury Data

2022

  • Developed automated system to query, validate, and present provisional mortality data
  • Created Python scripts to enhance data quality control processes
  • Established templates for future additions to the WISQARS platform

Special Act Award: Leadership on WISQARS Opioid Economic Burden Dashboard

2021

  • Provided leadership and technical ownership for the WISQARS Opioid Economic Burden Dashboard
  • Successfully managed technical completion on schedule
  • Collaborated with multiple stakeholders to resolve implementation challenges

Professional Affiliations

Villa International - Board of Directors

Serving on the board of a nonprofit organization that provides affordable, short-term housing for international visitors to Atlanta, primarily researchers at the CDC and Emory University. Villa International has hosted over 27,000 residents from 179 countries since 1972.

Learn more about Villa International

Scout Troop 477, Dunwoody GA - Board Member & Outdoor Chair

Serve on the troop committee as Outdoor Chair, responsible for planning and coordinating monthly scout trips. Duties include making reservations for campsites and activities, recruiting adult leadership for outdoor adventures, and ensuring all adult leaders receive appropriate training for the trips they will be leading.

Professional Memberships

  • CDC Python Users Group - Member
  • CDC R Users Group - Member
  • CDC Section 508 for Websites Workgroup - Member
  • CDC DataViz SAG - Member
  • NCIPC Web Developers Workgroup - Member

Additional Achievements

Eagle Scout Award, Boy Scouts of America

  • Highest achievement and rank in the Boy Scouts of America
  • Demonstrates leadership, service, and commitment to community

Languages

English Native
German Advanced speaking, intermediate writing and reading