Paimon
Apache Paimon innovatively combines lake format and LSM structure, bringing efficient updates into the lake architecture. To integrate Fluss with Paimon, you must enable lakehouse storage and configure Paimon as lakehouse storage. See more detail about Enable Lakehouse Storage.
Introduction
When a table with option 'table.datalake.enabled' = 'true' is created or altered in Fluss, Fluss will create a corresponding Paimon table with same table path as well.
The schema of the Paimon table is as same as the schema of the Fluss table, except for there are two extra columns __offset and __timestamp appended to the last.
These two columns are used to help Fluss client to consume the data in Paimon in streaming way like seek by offset/timestamp, etc.
Then datalake tiering service compacts the data from Fluss to Paimon continuously. For primary key table, it will also generate change log in Paimon format which enables you streaming consume it in Paimon way.