Skip to content
View avamingli's full-sized avatar

Block or report avamingli

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
avamingli/README.md

Hi, I'm Zhang Mingli

Database Kernel Developer:

  • Postgres Recognized Contributor - View Recognition
     Features, Bug fix and patch review.
  • PostgreSQL ACE(China PostgreSQL Association).
  • Greenplum Team (2019-2022, Pivotal/VMware)
     Greenplum Committer & Main author of Greenplum Streaming Server (GPSS).
  • Apache Cloudberry Major Contributor & PPMC
     Founding member of Cloudberry Database, which has contributed to the development of Apache Cloudberry.
     Contributed to almost every component of the system, such as planner, exetutor, storage, distributed transaction, etc.
     Main maintainer of Apache Cloudberry. A significant portion of the development work was completed before the project was
     open-sourced.
     Principal/ Sole author of many significant features: Parallel Query, AQUMV(Answer Query Using Materialized Views),
     Dynamic Tables and etc. Critical bug fixes and much more.

Open Source Contribution:

Postgres:

Critical commits:

Greenplum:

Some of critical fixes:

Apache Cloudberry

Answer Query Using Materialized Views(AQUMV).

Sole Author, automatically answer query using the results of Materialized Views, Dynamic Tables and Incremental Materialized views in planner.

Parallel Query

Principal Author, Design and implement the architecture of Parallel Query in Greenplum/Apache Cloudberry based on Postgres’ parallel codes.

Contributions before open source:
  • New locus: HashWorkers, SegmentGeneralWorkers.
  • Locus compatible for parallel join including: Parallel Hash/Nestloop/Merge Join, Parallel-aware Hash Join. Parallel inner, left, anti, semi join.
  • Append Only (AO) table’s Parallel SeqScan.
  • Append Only Column Orientation(AOCS) table’s Parallel SeqScan.
  • Parallel Create Table AS of AppendOnly table storage.
  • Parallel Refresh Materialized Views of AO/AOCO storage.
  • Explain(locus): show locus info of each plan node.
  • Insert into multiple segfiles for AO/AOCS table.
Open source contributions:
  • Parallel DEDUP_SEMI and DEDUP_SEMI_REVERSE Join.
  • Make UNION Parallel.
  • Parallel DISTINCT plan of multi-stage.
  • Parallel-oblivious Hash Left Anti Semi (Not-In) Join
  • Implement Parallel-aware Hash Left Anti Semi (Not-In) Join
  • Fix wrong results of Left Anti Semi (Not-In) Join
  • Add motionhazard to the outer side of parallel aware join.(fix flaky incorrect results of agg)
  • Refactor cdbpath_motion_for_parallel_join() by outer join inner style
  • Open proper AO/AOCS segment files according to data volume
  • Fix AO/AOCS insertDesc memory issue
  • Fix segfilecount of AO/AOCO when bulk insertion: COPY

Other Features:

  • [Feature] Dynamic Table.
  • Let Replicated locus join with others(Results of writeable CTE on replicated table Join with others)
  • Enable SingleQE join with SegmentGeneralWorkers
  • Implement 3-phase aggregation with DEDUP HashAgg for DISTINCT.
  • Optimize DISTINCT, ORDER BY and DISTINCT ON when Aggregation without Group By.

A Journey of Passion: My Story (Chinese)

Tech Talk (Chinese)

---_画板 1

Pinned Loading

  1. cloudberrydb cloudberrydb Public

    Forked from apache/cloudberry

    Zhang Mingli's CBDB develop repo

    C

  2. gpdb gpdb Public

    Forked from greenplum-db/gpdb-archive

    Greenplum Database

    C 1 2