Carol Zhang

Skills

Rust, Python, Spark, SQL, AWS, and some of their friends

Writing

Work

  1. 01/2020 - present: Database Developer at Era Software (acquired by ServiceNow)
    • Part of a team working on a logging database written in Rust.
    • Performance: Created automated benchmarks for query and ingest speed over three months of mostly independent work. Team members consistently used these graphs to verify if and how much their feature has affected performance. Product and customer teams have also used these for planning and external communication. Collected test cases and ran performance comparisons between our database and that of an external vendor’s as part of the acquisition process; proving we can achieve better query and ingest throughput while saving the parent company ∼50,000 USD per customer per year in licensing costs.
    • Testing: Build fuzzers to efficiently identify problematic inputs notably for a new text search crate. Two edge cases were found during the first run and patched the same week. Set up continuous monitoring and alerting in staging and production environments for critical operations; notifying my team and teams responsible for upstream services of potentially customer-facing incidents.
  1. 05/2018-01/2021: Senior Data Scientist at AdGear (acquired by Samsung)
    • Data infrastructure: Initiated, researched, and led efforts to replace existing infrastructure with Apache Airflow. This made sure workflows powering key business intelligence reports were organized, reliable, and easy to debug. Also worked with the data engineering team to build and maintain multiple cross-platform ETL pipelines.
    • Data science: Leveraged terabytes of smart TV usage data to answer product and business questions such as whether to expand into specific markets. Collaborated with machine learning and back-end teams to design and add context to automated ad bidding models; used existing data and conducted user interviews with ad campaign managers.
  1. 08/2016-05/2018: Software Developer
    • Build data infrastructure generating custom retention insights. Details
    • Use data analysis and visualization to answer business questions
  1. 2017-2018: Open Source Transportation Data Analyst
    • Improve public transit data transparency
    • Process and analyze subway delay times using Python
  1. 2015-2016: Quantitative Neuroscience Researcher
  1. 2012-2016: BSc. McGill University
    • Physics and Physiology Joint Major
    • Relevant coursework: Differential Equations, Data Structures and Algorithms, Database Systems, Nonlinear Dynamics and Chaos, Integrative Neuroscience, Mathematical Modelling, Biomedical Signals and Systems

Vancouver

  1. '12: Rick Hansen 25th Anniversary Relay Award
Not loitering

✌ Coded with love. ©