Portfolio

Portrait of Guanli Liu

Guanli (Leo) Liu · Backend Systems · Data Infrastructure · Research Engineering

I build data and AI systems, then turn them into reproducible, measurable products.

Postdoctoral researcher and engineer at the University of Melbourne. This site is organized as a growing portfolio of case studies across database systems, benchmarking, and LLM-assisted data workflows.

Research & Engineering Focus

Current focus areas linked to active systems, papers, and open-source delivery.

Drift-Aware Benchmarking Systems DriftBench and AI-DB benchmark workflows for reproducible workload and data drift evaluation.
PostgreSQL Extension Intelligence Extension-oriented benchmarking and integration of HMAB, GRASP, and CoLSE in a unified AI-DB runtime.
Data Layout and Query Performance Lakehouse physical design, indexing, and cost-aware optimization for analytical workloads.
LLM-Assisted Database Workflows Natural language to executable query pipelines with retrieval, decomposition, and result-grounded checks.
Open-Source Product Delivery GitHub-first development, PyPI releases, documentation websites, and repeatable CI/CD workflows.

Featured Build Stories

Projects are presented as build stories: problem context, technical implementation, and measurable outcome.

Benchmarking AI-DB PostgreSQL Extension

AI-DB Extension Benchmark

Built a benchmark and evaluation workflow for AI-DB as a PostgreSQL extension, with recent integration of three core projects: HMAB, GRASP, and CoLSE, enabling unified testing and comparison under one extension-oriented runtime.

Stack: PostgreSQL extension workflows, Python automation, reproducible benchmark harnesses, performance analysis.

System Design Lakehouse VLDB 2026 (Submitted)

LayoutPilot / Layout Advisory

Designed an advisory backend for analytical data lake workloads, including workload ingestion, SQL parsing, and reusable decision logic for partitioning and intra-file layout choices.

Stack: Python, SQL parsing workflows, experiment harnesses, Dockerized service environment.

LayoutPilot paper figure
Agent Infrastructure Prototype

Metadata-Native Agent Workspace Infrastructure

Proposed and prototyped an infrastructure direction where metadata drives task-aware context routing, access control, and recoverable actions for agent workflows.

LLM + DB Applied AI

LLM-Assisted Query Agents

Developed agents that translate natural language requests into executable database workflows through retrieval, query decomposition, and result-grounded evaluation.

Selected Publications

This section now reads from structured data and will be connected to the Scholar sync harness in the next workflow steps.

CoLSE: A Lightweight and Robust Hybrid Learned Model for Single-Table Cardinality Estimation using Joint CDF Published

Lankadinee Rathuwadu, Guanli Liu, Christopher Leckie, Renata Borovica-Gajic. ICDE, 2026.

Benchmarking RL-Enhanced Spatial Indices Against Traditional, Advanced, and Learned Counterparts Published

Guanli Liu, Renata Borovica-Gajic, Hai Lan, Zhifeng Bao. ICDE, 2026.

Efficient Cost Modeling of Space-filling Curves Published

Guanli Liu, Lars Kulik, Christian S. Jensen, Tianyi Li, Renata Borovica-Gajic, Jianzhong Qi. PVLDB, 2025.

Efficiently Learning Spatial Indices Published

Guanli Liu, Jianzhong Qi, Christian S. Jensen, James Bailey, Lars Kulik. ICDE, 2023.

Efficient Index Learning via Model Reuse and Fine-tuning Published

Guanli Liu, Jianzhong Qi, Lars Kulik, Kazuya Soga, Renata Borovica-Gajic, Benjamin I. P. Rubinstein. ICDEW, 2023.

Effectively Learning Spatial Indices Published

Jianzhong Qi, Guanli Liu, Christian S. Jensen, Lars Kulik. PVLDB, 2020.

Build Queue (Next Portfolio Drops)

  • Blockchain Data Platform at nftDb: Kafka + Airflow ingestion and BigQuery analytics workflow case study.
  • Baidu IM Backend: message protocol and deduplication reliability engineering breakdown.
  • Evaluation Tooling: reusable experiment templates and CI-style benchmark validation pipeline.

Work Experience

2019-Present: Postdoctoral Research Fellow / PhD Researcher, The University of Melbourne (Melbourne, Australia).

Leading research and engineering projects on database benchmarking, indexing, data layout, and AI-driven query processing.

Designing system prototypes, supervising junior researchers, and co-supervising master's students on spatial indexing and database systems.

Building backend systems and reproducible evaluation pipelines for data and AI workloads.

2023-2024: Data Scientist / Data Infrastructure Engineer, nftDb (Melbourne, Australia).

Built blockchain ingestion pipelines, dbt and SQL analytics workflows, and BigQuery-based analysis.

Developed an internal RAG assistant for engineering knowledge retrieval.

2015-2017: Software Engineer, Baidu (China).

Worked on large-scale messaging systems and protocol design for internal communication platforms.

Improved message deduplication and database performance in production backend services.

Mentoring, Teaching, and Research Service

Mentoring and Supervision

  • Co-supervising master's students on research projects in spatial indexing and database systems.
  • Supervising junior researchers in reproducible benchmarking and AI-driven data systems projects.

Teaching

  • COMP90018 - Android Application Development (The University of Melbourne): Tutor (Aug. 2019 - 2023), Responsible for tutorials, student support, and assessment marking.
  • COMP90041 - Programming and Software Development (The University of Melbourne): Tutor (Aug. 2019 - 2023), Responsible for tutorials and assessment marking.

Research Service

  • Conference Reviewer: SIGMOD 2026, ICDE 2027, VLDB 2027/2026/2025, KDD 2026/2025 (Excellent Reviewer).
  • Journal Reviewer: TKDE, WWW, and Transactions on Spatial Algorithms and Systems (TSAS).