Pingchuan Ma

I am a fourth year Ph.D. candidate at Department of Computer Science and Engineering, Hong Kong University of Science and Technology (HKUST), under the supervision of Prof. Shuai Wang. I am a visiting scholar in Sky Lab at UC Berkeley, hosted by Prof. Alvin Cheung. I received my B.Eng. from Beijing Electronic Science and Technology Institute. My research interests include data management and software engineering.

I can be reached at pmaab at cse dot ust dot hk or pmaab at berkeley dot edu.

Publications

PP-CSA: Practical Privacy-Preserving Software Call Stack Analysis
Testing Graph Database Systems via Graph-Aware Metamorphic Relations
Evaluating C/C++ Vulnerability Detectability of Query-Based Static Application Security Testing Tools
On Extracting Specialized Code Abilities from Large Language Models: A Feasibility Study
Enabling Runtime Verification of Causal Discovery Algorithms with Automated Conditional Independence Reasoning
InsightPilot: An LLM-Empowered Automated Data Exploration System
Explain Any Concept: Segment Anything Meets Concept-Based Explanation
Causality-Aided Trade-off Analysis for Machine Learning Fairness
PerfCE: Performance Debugging on Databases with Chaos Engineering-Enhanced Causality Analysis
Towards Practical Federated Causal Structure Learning
XInsight: eXplainable Data Analysis Through The Lens of Causality
CC: Causality-Aware Coverage Criterion for Deep Neural Networks
sem2vec: Semantics-Aware Assembly Tracelet Embedding
Deceiving Deep Neural Networks-Based Binary Code Matching with Adversarial Programs
NoLeaks: Differentially Private Causal Discovery Under Functional Causal Model
ML4S: Learning Causal Skeleton from Vicinal Graphs
Unlearnable Examples: Protecting Open-Source Software from Unauthorized Neural Code Learning
NeuralD: Detecting Indistinguishability Violations of Oblivious RAM with Neural Distinguishers
Enhancing DNN-Based Binary Code Function Search With Low-Cost Equivalence Checking
Unleashing the Power of Compiler Intermediate Representation to Enhance Neural Program Embeddings
MT-Teql: Evaluating and Augmenting Neural NLIDB on Real-world Linguistic and Schema Variations
MetaInsight: Automatic Discovery of Structured Knowledge for Exploratory Data Analysis
Metamorphic Testing and Certified Mitigation of Fairness Violations in NLP Models

Preprint

Benchmarking Multi-Modal LLMs for Testing Visual Deep Learning Systems Through the Lens of Image Mutation
Eliminating Information Leakage in Hard Concept Bottleneck Models with Supervised, Hierarchical Concept Learning
An Empirical Study on Large Language Models in Accuracy and Robustness under Chinese Industrial Scenarios
VRPTEST: Evaluating Visual Referring Prompting in Large Multimodal Models
InstructTA: Instruction-Tuned Targeted Attack for Large Vision-Language Models
Benchmarking and Explaining Large Language Model-based Code Generation: A Causality-Centric Approach
Split and Merge: Aligning Position Biases in Large Language Model based Evaluators
"Oops, Did I Just Say That?" Testing and Repairing Unethical Suggestions of Large Language Models with Suggest-Critique-Reflect Process

means corresponding author.

Award

  • Overseas Research Award, HKUST Fok Ying Tung Graduate School, 2024
  • UGC Research Travel Grant, 2023-24 Academic Year.
  • Future of Life Institute Travel Support, 2024.
  • UGC Research Travel Grant, 2022-23 Academic Year.
  • SIGMOD Student Travel Award, April 2023.
  • AISTATS "Top Reviewer", February 2023.
  • Microsoft Research Asia "Star of Tomorrow" Award, December 2022.
  • Microsoft Research Asia "Star of Tomorrow" Award, March 2022.
  • NVIDIA Academic Hardware Grant, March 2022.

Experience

  • Visiting Scholar, Sky Lab, UC Berkeley, hosted by Prof. Alvin Cheung, 2024,4 - present
  • Research Intern, Microsoft Research Asia, mentored by Justin Ding, 2022.6 - 2022.11
  • Research Intern, Microsoft Research Asia, mentored by Justin Ding, 2021.6 - 2022.4
  • Research Intern, Microsoft Research Asia, mentored by Justin Ding, 2019.12 - 2020.6

Talk

  • Elevating Exploratory Data Analysis in The Era of Large Language Model, Huawei, 15 Dec, 2023.
  • Learning Causal Skeleton from Vicinal Graphs, Microsoft Research Asia, 5 Aug, 2022.
  • Towards Dependable and Transparent Data Analytics Platforms, Microsoft Research Asia, 10 Mar, 2022.
  • Automated Fairness Testing and Beyond, Microsoft Research Asia Causality Reading Group, 6 Sept, 2021.
  • Metamorphic Testing and Certified Mitigation of Fairness Violations in NLP Models (in Chinese), AI Time, 15 Jan, 2021.

Academic Service

  • Conference Program Committee/Reviewer of ECML/PKDD '24, KDD '24, SDM '24, NeurIPS Ethics Review '23, SIGMOD ARI '23, Queer in AI @ ACL '23, ECML/PKDD '23, FAccT '23, KDD '23, AISTATS '23, PETS '23 Artifact Evaluation, PETS '22 Artifact Evaluation, ISSTA '22 Artifact Evaluation, EuroSys '22 Artifact Evaluation
  • Journal Reviewer of Harvard Data Science Review, Scientific Reports, Journal of Systems & Software (JSS), International Journal of Computer Vision (IJCV), IEEE Signal Processing Letters (SPL)
  • Sub-Reviewer of ISSTA '24, FSE '24, ISSTA '23, USENIX Security '23, NeurIPS '22, PETS '22, CCS '22, ASE '22, DBTest '22, AsiaCCS '22, AsiaCCS '21, ICSE '20 Artifact Evaluation, and USENIX Security '20 Artifact Evaluation

Teaching Experience

  • Teaching Assistant, COMP4901N: Competitive Programming in Cybersecurity (Fall 2021)
  • Teaching Assistant, COMP6613C: Topics in Computer Security and Privacy (Spring 2021)