I am currently working at Amazon as a Data Engineer. I started programming since high school, and I am proficient in Python, R and SQL and data skills in PySpark, Hive and AWS cloud computing.
PhD in Chemistry, spec. Quantitative Biology, 2019
Brandeis University
BSc in Materials Chemistry, 2012
Nankai University
BSc in Finance, 2012
Nankai University
Python, Django, SQL, Java, git, Bash, R, Matlab, Lisp, Javascript
IAM, EC2, Redshift, Glue, Athena, EMR, Lambda, S3, RDS, DynamoDB
Hadoop, Hive, Spark, Kafka, Airflow
I setup Postgresql as database for Django and it works as a charm. However, today I got a error message when I tried to migrate my …
This is my implementaion on the Django tutorial.
The org-mode code blocks can be used for literate programming and creating executable snippets. There is also a quick way to insert …
由于我习惯写中文博客,所以将写博客这件事也转移到 Emacs 后,我渐渐感觉到 pyim 的不足。所以今天研究一下如何让 pyim 调用 Rime 的词库。
I am currently (still) seeking a job in data/software engineering area, and I am preparing for all kinds of technical interviews, …
• Provide smooth transition from accounting app to plain text accounting tools.
• Convert .csv
file exported from Sui accounting app …
• Extracted cancer research data from the Cancer Genome Atlas Network ®.
• Applied GISTIC clustering analysis to patient-indexed …
• Write a class to handle multithreading website crawling inside the given domain.
• Feature a breath-first search algorithm and a …
• Provide a personalized websites collection and naviation page.
• Built with Javascript, CSS and npm.
• Developed an ETL pipeline to extract, integrate and transform prescription data from multiple providers in AWS cloud computing to …