About me

I’m a data engineer/software engineer with a Ph.D. and over 5 years’ experience in developing, and optimizing web applications and Database Management Systems. I recently got a Springboard certificate in Data Engineering that growing my skills in Python, SQL, Data Warehousing, Azure, Apache Airflow, Apache Kafka, Docker, and Kubernetes. I also currently using my knowledge and skills in helping college students with computer science subjects (Database Management, C++, Java, and Python) as a online part-time tutor.

Resume

Education

Experience

tutor.com United States (remote) (Sep 2020 - present)

Online Tutor

  • Working on an online one-to-one mentoring program that helps college and K-12 students on Database Management System, C++, Java, and Python programming subjects.

Springboard United State (remote) (Sep 2020 - Mar 2021)

Data Engineer Fellow

  • 450+ hours industry-driven curriculum with weekly 1:1 industry expert mentor. Mastering skills in Python, SQL, Azure, Apache Airflow, Apache Kafka, Kafka Streaming API, Docker, and Kubernetes.

VLDB laboratory, Sungkyunkwan University, Suwon, Korea (Aug 2019 - Oct 2019)

Postdoctoral Researcher

  • Researches and implements a partition REDO logging architecture that exploits NVDIMM to improve MySQL/InnoDB’s throughput up to 2.8 times and reduce the recovery time by up to 11 times in OLTP workloads
  • Referee: Professor Sang-Won Lee

VLDB laboratory, Sungkyunkwan University, Suwon, Korea (Sep 2014 - Aug 2019)

Ph.D researcher

  • First author of two SCI journal ([SCIE paper1], [SCIE paper2] ) and two conference papers ([conf. paper1], [conf. paper2]).
  • Design and implements a novel partition buffer to improve throughputs of InnoDB and WiredTiger by up to two times. [paper]
  • Using multi-stream SSD for optimizing WiredTiger. As the result, throughputs are improved by up to 65%. [SCIE paper].
  • Advisor: Professor Sang-Won Lee

DB laboratory, Kyungpook National University, Daegu, Korea (Mar 2012 - Feb 2014)

Research Assistant

  • Developed a front-end Android application and a back-end server for matching stock prices patterns using a subsequence matching algorithm.
  • Researched and evaluated with graph database (e.g., Neo4j, GraphBase).
  • Advisor: Professor Wook-Shin Han

Skills

  • Tools and Technologies: Python, C/C++, Java, SQL, MySQL, Microsoft Azure, Docker, HDFS, Apache Spark, Apache Airflow, Tableau, Hadoop, Git.
  • Industrial Knowledge: Object-oriented Programming (OOP), Big Data, Data Warehousing, Cloud Computing, Software Project Management, Extract, Transform, Load (ETL), Data Modeling, Business Analysis.