Education
York University, Ph.D. of Software Engineering
Sep. 2020 – NowUniversity of Waterloo, Master of Software Engineering
Dec. 2017 – May. 2019
Publication
CoCoFuzzing: Testing Neural Code Models with Coverage-Guided Fuzzing
Moshi Wei, Yuchao Huang, Jinqiu Yang, Junjie Wang, and Song Wang
IEEE Transactions on Reliability (TR’22)\API Recommendation for Machine Learning Libraries: How Far Are We?
Moshi Wei, Yuchao Huang, Junjie Wang, Jiho Shin, Shiri harzevili Nima, and Song Wang
ESEC/FSE 2022\CLEAR: Contrastive Learning for API Recommendation
Moshi Wei, Shiri harzevili Nima, Yuchao Huang, Junjie Wang, and Song Wang
ICSE 2022Automatic Unit Test Generation for Machine Learning Libraries: How Far Are We?
Song Wang, Nishtha Shrestha, Abarna Kucheri Subburaman, Junjie Wang, Moshi Wei, and Nachiappan Nagappan
ICSE 2021Yet Another Combination of IR-and Neural-based Comment Generation
Yuchao Huang, Moshi Wei, Song Wang, Junjie Wang, Qing Wang
IST 2022Coconut: combining context-aware neural translation models using ensemble for program repair
Lutellier Thibaud, Pham Hung Viet, Pang Lawrence, Li Yitong, Wei Moshi, Tan Lin
ISSTA 2020Survey on Automated API recommendation
Wei, Moshi
Experience
Ph.D. Candidate, York University\
Sep. 2020 – Now
Extracted and formatted data from Stackoverflow and the official Java documentation using BS4, followed by tokenization and EDA analysis using Scikit-learn, StanfordNLP, and NLTK, and visualisation using Seaborn and Pyplot.
Created sourcecode pro-processing pipline with Python AST parser, APIs are extracted from a code snippet and tokenized with a custom tokenizer.
Trained MLP models such as Seq2seq LSTM, BERT, CodeBERT, Pegasus, Text-CNN with Huggingface, Pytorch or Tensorflow with multiple loss functions for performance optimization such as contrastive learning, prototypical network, siamese network, and compared baseline models such as SVM, NB, KNN, and XGBoost build with sklearn based on multiple metrics such as MRR, MAP, precision, recall, and F1
Ericsson – Software Engineer (Intern)\
Sep. 2021 – Jan. 2022
- Joined the research team as a research intern for prototyping a web-based low-code webpage editing platform with web components, AngularJS, and GrapeJS.
Software Engineer – BI, Achievers\
May. 2019 – Sep. 2020
- Initiated and developed an awarded collaborative filtering-based user recommendation system with Spacy, Scikit-learn, and pandas. Run the system on 200 users for testing and feedback
- Developed and maintained daily ETL tasks with Centerprise ETL and PostgreSQL
- Speedup TB-level ETL data warehouse restoration time by 24 times by restructuring workflow using PostgreSQL and python