About Me

The About Me page has been rewrited! Now it uses tabs to collapse the content… to save space from being wasted too often~

Personal Profile

Self-proclaimed: Silen Wang / Sylens Wong, is a rosted egg that is gradually becoming happier!
Interests: Playing games, playing games, and still damn playing! Games! Also watch anime, ~~but recently I’m less and less interested in series…~~, In 2024, I started chasing series again! ダンジョン飯 is really amazing! Go watch it!
Learning/Working Path: Public Health -> Molecular Epidemiology -> Bioinformatics -> Neoantigens -> Organoids -> All kinds of things
- emmmm… becoming more and more eclectic
- Word cloud updateded! But not as much differenct as I had imagined …

GitHub:

Education

2009.09-2014.06

School: Guangdong Pharmaceutical University (formerly Guangdong Pharmaceutical College)
College: School of Public Health
Major: Preventive Medicine
Degree: Bachelor

2014.09-2017.06

School: Southern Medical University
College: School of Public Health
Major: Epidemiology and Health Statistics
Degree: Master

Work Experience

Employment: Bioinformatics Analysis Engineer at a bioinformatics service company in Tianjin
Work Content:
1. Human genomic next-generation sequencing data (WES / WGS) quality control and analysis
  - Variant result screening based on frequency, gene region, pathogenicity software prediction, and family inheritance patterns
  - Localization of potentially harmful variants based on genome-wide association study (GWAS) results
  - Customization of screening plans and presentation of results according to customer requirements
2. Research on the current status of specific diseases
  - Research and learn the basic knowledge and current research status of specific diseases to prepare for the development of related analysis modules
3. Maintenance and upgrade of data delivery processes

Employment: Bioinformatics Engineer at a genetic technology company in Hangzhou
Work Content:
- Data Processing
  1. NGS data processing of cfDNA/FFPE/white blood cell samples
  2. Construction and testing of NGS analysis processes
  3. Development of tools for NGS analysis (Python / R / Julia / Shell) and quality control of cell sample sequencing data
  4. Alignment and quality control of raw sequencing data (fastq)
  5. Processing of UMI data / Molecular Barcode data
- Analysis Process Related
  1. Debugging of existing bioinformatics analysis processes
  2. Writing new analysis modules and integrating them into the existing pipeline
  3. Testing and debugging of analysis modules
  4. Deployment and dependency management of analysis software
  5. Performance testing of analysis software and comparison of analysis results
  6. Writing and maintaining process documentation
  7. Construction and testing of single-cell sequencing analysis processes
- Algorithm and Tool Development:
  1. Comparison and replication of UMI deduplication algorithms
  2. Development of tools for drawing quality control plots
  3. Development of variant result screening tools based on ML
  4. Performance optimization of existing tools (parallelization)

2019.07-2020.03 Bioinformatics Engineer at a biotechnology company in Hangzhou
1. Research on NGS-based cfDNA analysis plans, construction of analysis workflows
2. Statistical analysis of neoantigen-related data
3. Maintenance of bioinformatics analysis programs
4. Maintenance of the medication guidance report database
2020.03-2024.04 Bioinformatics Supervisor at a biotechnology company in Hangzhou
1. Development of NGS-based HLA typing/quantification modules
2. Backend development of bioinformatics analysis tools based on flask
3. Development of neoantigen screening modules
4. Maintenance and upgrade of the main bioinformatics analysis workflow
5. Writing and revising materials as required

2024.05-Present Senior Bioinformatics Analysis Engineer at a medical technology company in Hangzhou
1. Development of WES/RNA-Seq/single‑cell analysis workflows
2. Research, interpretation, formulation and execution of bioinformatics analysis plans according to customer requirements
3. Construction, operation and maintenance of bioinformatics databases
4. Operation and maintenance of bioinformatics servers
5. Operation and maintenance of websites and system cloud servers
6. Development, operation, maintenance, migration and automated testing of websites and internal laboratory management systems
7. Operation and maintenance of WeChat mini‑programs

Commonly Used Languages & Tools

Programming Languages

Data Science / Database

Web

IDE / Dev

DevOps

Visualization

Bioinformatics

Operating Systems

Programming Skills

Parallelization: Multiprocess
Next‑generation sequencing file handling: Pysam, PyVCF, CyVCF, Biopython, Scanpy
Web scraping:
- scrapy: Several small‑scale scraping project experiences, capable of using scrapy for static and non‑encrypted dynamic page scraping and data extraction
- Able to use splash, selenium with scrapy to scrape dynamic page data
Data processing / statistical computing: Pandas, numpy, math, scipy, statsmodels
Automated testing: Pytest
Plotting: Plotly, altair
Interactive data display: Dash
Image processing: pillow
WebUI/API development: FastAPI, Taipy, flask, gradio, nicegui
Office automation: docxtpl, openpyxl, python‑docx

Data processing: base, dplyr
Parallelization: Parallel
Plotting: ggplot2, ggpubr, ggthemes
Web development: Shiny, plumber

Linux system installation/configuration management: Red‑hat family distribution operation/maintenance, Debian family distribution installation/configuration/maintenance, Arch family distribution installation/configuration/maintenance
HPC cluster management system: SUN Grid Engine use and maintenance experience
Snakemake: Proficient in writing Snakefiles, proficient in using Snakemake for workflow construction and management, rich practical Snakemake experience
Nextflow: Able to understand nextflow configuration files, experience in converting nextflow workflows to Snakemake workflows
Container technology:
- Containerized development experience, able to configure development containers compliant with Devcontainer specifications for local or remote development
- Rich container usage and deployment experience, able to use mainstream container software such as Docker, Podman, Apptainer (formerly Singularity), able to write Dockerfiles and config files to build Docker or Sif containers (Example)
Pixi/Conda/Mamba: Sufficient practical experience, able to use conda to quickly deploy analysis software and workflows
Git: Experience using Git for project code/document management, understands basic creation/push/merge methods, has used hook features to run some automated tasks
DevOps: Understand basic DevOps concepts, able to use Github actions / Azure Pipeline platforms for code management, automated testing and deployment

Mastered Bioinformatics Software/Modules

emmm… I’ve used quite a few, and I haven’t used many of them in the past year… But I should be able to get reacquainted with them all within a week, hahahhahaha

Next‑generation sequencing data splitting: bcl2fastq
BED file manipulation: bedtools
Data quality control: fastp, fastQC, MultiQC
Sequence alignment: bwa, bowtie, STAR
Expression quantification/analysis: DESeq2, DEXseq, Salmon, kallisto, htseq, Hisat2
Sequence processing: seqtk, seqkit
BAM/VCF file manipulation: Picard, Samtools, Bcftools, Sambamba, Pysam, Biogo
UMI processing: UMI‑tools, fgbio

Analysis frameworks: cellranger, rhapsody, Scanpy, Seurat
Cellular composition differences: Milopy, miloR
Cell annotation: SingleR, celltypist, Azimuth
Pseudotime: Monocle, scvelo, cellrank, pymonocle
Cell‑cell communication: Cellchat

SNP/INDEL detection: Samtools, GATK(Mutect2), Vardict, Vacscan, Strelka
DNA structural variation / fusion gene detection: CREAST, lumpy‑sv, Smoove, SViCT, SvABA, Manta, Delly, GeneFuse, Factera, sv‑tools
CNV detection: CNVkit, PyLOH
Variant annotation: ANNOVAR, snpEff, VEP

Clonality: PyClone, SciClone, FastClone
Sequencing data genetic consistency checking: plink, NGSCheckMate
Mass spectrometry identification: pFind, maxquant

Other Work Skills

Patent draft writing
Software copyright writing
Research paper writing
Static blog/document writing
- hexo
- mkdocs
- gitbook

Displayable Projects

Github Projects

Published Articles

I generally only take responsibility for the bioinformatics related parts in the following articles:

Ye Feng et al., “Multi-Epitope Vaccine Design Using an Immunoinformatic Approach for SARS-CoV-2” Pathogens 10 (June 11, 2021) 737.
Zheling Chen et al., “A Neoantigen-Based Peptide Vaccine for Patients With Advanced Pancreatic Cancer Refractory to Standard Treatment” Front. Immunol. 12 (2021) 691605.
Jiawei Shou et al., “Combination Treatment of Radiofrequency Ablation and Peptide Neoantigen Vaccination: Promising Modality for Future Cancer Immunotherapy” Front. Immunol. 13 (September 29, 2022) 1000681.