About Me
The About Me page has been rewrited! Now it uses tabs to collapse the content… to save space from being wasted too often~
Personal Profile
- Self-proclaimed: Silen Wang / Sylens Wong, is a rosted egg that is gradually becoming happier!
- Interests: Playing games, playing games, and still damn playing! Games! Also watch anime,
but recently I’m less and less interested in series…, In 2024, I started chasing series again! ダンジョン飯 is really amazing! Go watch it! - Learning/Working Path: Public Health -> Molecular Epidemiology -> Bioinformatics -> Neoantigens -> Organoids -> All kinds of things
- emmmm… becoming more and more eclectic
- The word cloud below can’t represent me anymore, updating soon…

- GitHub:
Education
- School: Guangdong Pharmaceutical University (formerly Guangdong Pharmaceutical College)
- College: School of Public Health
- Major: Preventive Medicine
- Degree: Bachelor
- School: Southern Medical University
- College: School of Public Health
- Major: Epidemiology and Health Statistics
- Degree: Master
Work Experience
- Employment: Bioinformatics Analysis Engineer at a bioinformatics service company in Tianjin
- Work Content:
- Human genomic next-generation sequencing data (WES / WGS) quality control and analysis
- Variant result screening based on frequency, gene region, pathogenicity software prediction, and family inheritance patterns
- Localization of potentially harmful variants based on genome-wide association study (GWAS) results
- Customization of screening plans and presentation of results according to customer requirements
- Research on the current status of specific diseases
- Research and learn the basic knowledge and current research status of specific diseases to prepare for the development of related analysis modules
- Maintenance and upgrade of data delivery processes
- Human genomic next-generation sequencing data (WES / WGS) quality control and analysis
- Employment: Bioinformatics Engineer at a genetic technology company in Hangzhou
- Work Content:
- Data Processing
- NGS data processing of cfDNA/FFPE/white blood cell samples
- Construction and testing of NGS analysis processes
- Development of tools for NGS analysis (Python / R / Julia / Shell) and quality control of cell sample sequencing data
- Alignment and quality control of raw sequencing data (fastq)
- Processing of UMI data / Molecular Barcode data
- Analysis Process Related
- Debugging of existing bioinformatics analysis processes
- Writing new analysis modules and integrating them into the existing pipeline
- Testing and debugging of analysis modules
- Deployment and dependency management of analysis software
- Performance testing of analysis software and comparison of analysis results
- Writing and maintaining process documentation
- Construction and testing of single-cell sequencing analysis processes
- Algorithm and Tool Development:
- Comparison and replication of UMI deduplication algorithms
- Development of tools for drawing quality control plots
- Development of variant result screening tools based on ML
- Performance optimization of existing tools (parallelization)
- Data Processing
- 2019.07-2020.03 Bioinformatics Engineer at a biotechnology company in Hangzhou
- Research on NGS-based cfDNA analysis plans, construction of analysis workflows
- Statistical analysis of neoantigen-related data
- Maintenance of bioinformatics analysis programs
- Maintenance of the medication guidance report database
- 2020.03-2024.04 Bioinformatics Supervisor at a biotechnology company in Hangzhou
- Development of NGS-based HLA typing/quantification modules
- Backend development of bioinformatics analysis tools based on flask
- Development of neoantigen screening modules
- Maintenance and upgrade of the main bioinformatics analysis workflow
- Writing and revising materials as required
- 2024.05-Present Senior Bioinformatics Analysis Engineer at a medical technology company in Hangzhou
- Development of WES/RNA-Seq/single‑cell analysis workflows
- Research, interpretation, formulation and execution of bioinformatics analysis plans according to customer requirements
- Construction, operation and maintenance of bioinformatics databases
- Operation and maintenance of bioinformatics servers
- Operation and maintenance of websites and system cloud servers
- Development, operation, maintenance, migration and automated testing of websites and internal laboratory management systems
- Operation and maintenance of WeChat mini‑programs
Work-related Skills
Commonly Used Languages & Tools
Programming Languages
Data Science / Database
Web
IDE / Dev
DevOps
Visualization
Bioinformatics
Operating Systems
Programming Skills
- Parallelization: Multiprocess
- Next‑generation sequencing file handling: Pysam, PyVCF, CyVCF, Biopython, Scanpy
- Web scraping:
- scrapy: Several small‑scale scraping project experiences, capable of using scrapy for static and non‑encrypted dynamic page scraping and data extraction
- Able to use splash, selenium with scrapy to scrape dynamic page data
- Data processing / statistical computing: Pandas, numpy, math, scipy, statsmodels
- Automated testing: Pytest
- Plotting: Plotly, altair
- Interactive data display: Dash
- Image processing: pillow
- WebUI/API development: FastAPI / Taipy / flask / gradio / nicegui
- Office automation: docxtpl, openpyxl, python‑docx
- Data processing: base, dplyr
- Parallelization: Parallel
- Plotting: ggplot2, ggpubr, ggthemes
- Web development: Shiny, plumber
- Linux system installation/configuration management: Red‑hat family distribution operation/maintenance, Debian family distribution installation/configuration/maintenance, Arch family distribution installation/configuration/maintenance
- HPC cluster management system: SUN Grid Engine use and maintenance experience
- Snakemake: Proficient in writing Snakefiles, proficient in using Snakemake for workflow construction and management, rich practical Snakemake experience
- Nextflow: Able to understand nextflow configuration files, experience in converting nextflow workflows to Snakemake workflows
- Container technology:
- Containerized development experience, able to configure development containers compliant with Devcontainer specifications for local or remote development
- Rich container usage and deployment experience, able to use mainstream container software such as Docker, Podman, Apptainer (formerly Singularity), able to write Dockerfiles and config files to build Docker or Sif containers (Example)
- Pixi/Conda/Mamba: Sufficient practical experience, able to use conda to quickly deploy analysis software and workflows
- Git: Experience using Git for project code/document management, understands basic creation/push/merge methods, has used hook features to run some automated tasks
- DevOps: Understand basic DevOps concepts, able to use Github actions / Azure Pipeline platforms for code management, automated testing and deployment
Mastered Bioinformatics Software/Modules
- emmm… I’ve used quite a few, and I haven’t used many of them in the past year… But I should be able to get reacquainted with them all within a week, hahahhahaha
- Next‑generation sequencing data splitting: bcl2fastq
- BED file manipulation: bedtools
- Data quality control: fastp, fastQC, MultiQC
- Sequence alignment: bwa, bowtie, STAR
- Expression quantification/analysis: DESeq2, DEXseq, Salmon, kallisto, htseq, Hisat2
- Sequence processing: seqtk, seqkit
- BAM/VCF file manipulation: Picard, Samtools, Bcftools, Sambamba, Pysam, Biogo
- UMI processing: UMI‑tools, fgbio
- Analysis frameworks: cellranger, rhapsody, Scanpy, Seurat
- Cellular composition differences: Milopy, miloR
- Cell annotation: SingleR, celltypist, Azimuth
- Pseudotime: Monocle, scvelo, cellrank, pymonocle
- Cell‑cell communication: Cellchat
- SNP/INDEL detection: Samtools, GATK(Mutect2), Vardict, Vacscan, Strelka
- DNA structural variation / fusion gene detection: CREAST, lumpy‑sv, Smoove, SViCT, SvABA, Manta, Delly, GeneFuse, Factera, sv‑tools
- CNV detection: CNVkit, PyLOH
- Variant annotation: ANNOVAR, snpEff, VEP
- Clonality: PyClone, SciClone, FastClone
- Sequencing data genetic consistency checking: plink, NGSCheckMate
- Mass spectrometry identification: pFind, maxquant
Other Work Skills
- Patent draft writing
- Software copyright writing
- Research paper writing
- Static blog/document writing
- hexo
- mkdocs
- gitbook
Displayable Projects
Github Projects
Published Articles
I generally only take responsibility for the bioinformatics related parts in the following articles:
- Ye Feng et al., “Multi-Epitope Vaccine Design Using an Immunoinformatic Approach for SARS-CoV-2” Pathogens 10 (June 11, 2021) 737.
- Zheling Chen et al., “A Neoantigen-Based Peptide Vaccine for Patients With Advanced Pancreatic Cancer Refractory to Standard Treatment” Front. Immunol. 12 (2021) 691605.
- Jiawei Shou et al., “Combination Treatment of Radiofrequency Ablation and Peptide Neoantigen Vaccination: Promising Modality for Future Cancer Immunotherapy” Front. Immunol. 13 (September 29, 2022) 1000681.