CV

Research Interests

  • Controlled Text Generation - Persona Control
  • Fraud Detection

Research Experience

2023/05/08 - now Working as a Research Assistant at The Chinese University of Hong Kong, Shenzhen

  • Reconstructed the Self-Instruct pipeline code for the team
  • Conducted the research on how LLMs can be applied to the user simulator
  • Papers: PlatoLM: Teaching LLMs via a Socratic Questioning User Simulator
    • arxiv: https://arxiv.org/abs/2308.11534v4
    • code: https://github.com/FreedomIntelligence/PlatoLM
    • model: https://huggingface.co/FreedomIntelligence/PlatoLM-7B
      • has achieved SOTA performance from 2023/08 to 2023/10 in Alpaca-Eval and MT-Bench.
    • dataset: https://huggingface.co/datasets/FreedomIntelligence/SocraticChat

04/2019 Construction of Logistics Service Quality Model for Used E-Commerce Platforms (Xianyu App)

  • Preliminarily constructed the evaluation index system in terms of SERVQUAL Model, the overall problems of logistics services of second-hand E-commerce platform, and the factors affecting quality
  • Built the final evaluation index system by Exploratory Factor Analysis (SPSS) through the first questionnaire survey about the logistics services quality of second-hand E-commerce platform
  • Carried out the second questionnaire survey on the Xianyu App to evaluate and analyze the logistics service quality by means of Fuzzy Hierarchical Comprehensive Evaluation (SPSS)
  • Awarded an outstanding thesis at the College level

Education

M.S. in Business and Data Analytics - Information System Stream

  • City University of Hong Kong, 2021
  • GPA: 3.57/4.3
  • Graduated with distinction
  • Awards: Outstanding Student Gold Award with the highest GPA(3.72/4.3) in Sem A at Major Level
  • Courses: Data Mining, Big Data & Social Media Analytics, Advanced Software Construction, Statistical Data Analysis, Database Management Systems, Applied Linear Statistical Modelling
  • Domestic Certification: Statistics

B.S. in Management Science & Engineering - Supply Chain Direction

  • Beijing Normal University, Zhuhai, 2019
  • GPA: 3.42/4
  • Major Rank: 27/225 (top 12%)
  • Awards: Excellent Graduation Thesis at College Level, Third Prize Scholarship at College Level
  • Courses: Calculus, Operations Research, Linear Algebra, Applied Statistics, Forecasting and Decision Making, Logistics Information Systems, Probability Theory and Mathematical Statistics

Skills

  • Programming languages: python, golang, Java, cpp, R, JavaScript, html, css
  • Database: mysql, redis, sql-server, mongodb
  • OS: Linux(Centos/Ubuntu)
  • Libraries/Packages: pytorch, libtorch, numpy, pandas, sklearn, flask, sqlalchemy, django, drf, gin, gorm
  • ML/DL Model: lr, xgb, lgb, catboost, resnet, lstm, gru, transformer, Bert, autoEncoder, GAN
  • Software: SPSS, SASEM, Tableau

Project Experience

01/2023 Online learning blog community construction

  • Responsible for web backend implementation
  • Used Technologies: gin + gorm + machinery + swagger + mysql + redis + vue

09/2022-12/2022 Server-side Chatbot “Big Head” v2.0

  • Responsible for model training, back-end implementation, partial front-end implementation
  • Used Technologies: ubuntu + git + pytorch + numpy + matplotlib + django + vue + mysql + redis + celery
  • Used Model: seq2seq(encoder: bert + transformer + gru, decoder: gru + transformer + gpt2)
  • Link
  • Alternate Link

05/2022-06/2022 Food Classification and Network Compression

  • Responsible for code optimization, model training (Kaggle competition)
  • Used Technologies: numpy + pytorch + PIL + tqdm + matplotlib
  • Used Model: Resnet, MobileNet
  • Rank: TOP 4% (56/1406)

06/2021-07/2021 Multiple classifications of Telstra network service outage severity

  • Responsible for code writing, model training (Kaggle competition)
  • Used Technologies: numpy + pytorch + tqdm + matplotlib + Bayesian optimization + grid search cross-validation
  • Used Model: lightgbm, catboost
  • Rank: TOP 20% (limited by computing capacity)

Project Overview

Development

  • python: Built full-stack chatbots, back-end web Q&A platform
  • golang: Built small instant messaging system, back-end blog community
  • java: Built family income and expenditure bookkeeping system, customer information management system, development team staff scheduling system

Algorithm

  • ML: regression, binary classification (rf, lr, fcn), multiclassification (xgb, lgb, catboost)
  • CV: image recognition (vgg16, resnet18-101), image generation (dcgan, wgan, wgan-gp), anomaly face detection (fcn-ae, cnn-ae, resnet18-ae, vae)
  • NLP: extractive question-and-answer prediction (bert), conversational bots (gru+luong attention), sentiment analysis
  • Speech: phoneme recognition (fcn), speaker recognition (transformer, conformer)
  • Other: transfer learning (DaNN), model compression (knowledge distillation, deep separable networks)

Database

  • mysql: designed database with query optimization

Language Skills

  • JLPT-N2
  • CET-6

Hobbies

  • Animate, Comic, Game, Novel