Implementation of Confidential Assets

Implementing confidential assets on the Beam blockchain takes advantage of the LelantusMW protocol enhancing the privacy and security for all transactions.

CA support
- Blinded asset tags, similar to the Elements design by A. Poelstra, can optionally be associated with each Unspent Transaction Output (UTXO).
- Asset tags have a proof of validity based on the 1-out-of-many Sigma protocol, developed by Jens Groth.
Shielded pool (e.g., LelantusMW)
- CA support for shielded operations.
- One-side payments and direct anonymous payments support.
The system design is heterogeneous in nature:
- All kernels carry excess blinding factors and may include extra validation rules such as height lock and relative lock.
- Some kernels may control subsystems:
  - Asset control (creation, emission).
  - Shielded operations (mint, spend).
- Kernels not only affect the balance but include side effects.

Confidential assets support

This current design differs significantly from the previous, older design.

We identify each asset by AssetID as a 32-bit integer. For each asset there's an appropriate NUMS generator, which is generates deterministically from the AssetID (via hashing). The AssetID == 0 is reserved for default asset (Beam).

UTXO encoding

Due Mimblewimble (MW), it is feasible to encode UTXOs (asset types) using different NUMS (nothing-up-my-sleeve) generators. The UTXO representing the asset is comprises the following:

Blinded generator: $$H^* = H_i + k_A•G$$
Asset surjection proof (verifies the provided generator is indeed one of the generators listed (with arbitrary blinding factor added).
Pedersen commitment: $$C = k•G + v•H^*$$
Rangeproof (bulletproof): in terms of this blinded generator.

Asset surjection proof

Derived from the Sigma protocol, where the prover specifies a range of AssetID values, and proves that the specified generator with arbitrary blinded factor is one of the generators within that range.

Meanwhile, the verifier generates a list of asset generators for the provided range, methodically subtracting the provided blinded generator from each element on the list. The Prover confirms the Sigma protocol by providing the opening of one of the elements for the blinding factor, i.e., the G generator.

Asset control

Asset controls grants any user permission to create new asset types vs Beam which are automatically released into circulation with each new block generated. Assets are emitted and burned by the asset owner. The asset lifecycle has three stages: Asset creation, asset emission/burn and asset destroying.

Asset creation

When sending an asset creating transaction type, it provides both the Owner key and associated metadata. Any proceeding asset actions will require a private key signature, and metadata visible to all users is immutable once the asset is created.

Metadata reserves a significant number of Beams when locking an asset, meaning that this transaction implicitly uses up that amount. If the transaction is successful, the system assigns the lowest available and unused AssetID to the asset and links it to the asset.

Asset emission/burn

The user initiates a transaction by sending a unique asset emission kernel, which can have a positive or negative emission amount. To complete the transaction, the kernel excess blinding factor must be signed by the appropriate private key of the asset owner.

This transaction will automatically create or consume a certain amount of the asset, which should be balanced out by other transaction elements such as inputs and outputs.

Asset destroying

To destroy an asset, an asset destroying kernel with owner key signature is required. Once the asset has been destroyed, the AssetID is no longer linked to the owner, and the locked Beams get returned to the user.

Asset can be destroyed only if:

Total burn occurs.
Minimum lock period elapsed after asset burn completes without an emitted event.

This minimum lock period prevents any asset owner manipulations or tampering. For example, if a user requests a specific asset, the AssetID must be directly linked with the asset info (Metadata and owner key) before its transaction expires. This tweak prevents the the asset owner from destroying or re-create an asset during the lock period.

Asset state

The system state contains a commitment to the most recent assets state. It's an MMR root of all the currently active assets, with their info, which includes:

Static info: Metadata and owner public key
Current emission value
Lockheight - the most recent height of the asset burned/not-burned transition. Based on this users can:
- Asset owner: knows if/when it can destroy the asset, and get the locked funds back
- Other users: estimate the minimum height range when this asset can be used safely. i.e. can it disappear due to reorgs in the past, or tampered in the future.

The ever need to lock the funds for asset creation is needed to prevent system spamming. Not only excess of assets will make Nodes heavier, it'll also make the asset surjection proof less effective (since its anonymity set is limited).

However despite the need to lock considerable funds, this design should be ok for users that just want to experiment with assets, since they are supposed to get the funds back once they finish. Organizations that sell their asset to users - not obvious if they will ever be able to burn their asset back (for this they first need to own all their asset). But this seems as a justified risk.

Note on transaction repeatability

One of the problems specific to UTXO-based systems in general and MW particularly is repeatability. If an attacker controls all the inputs of a specific transaction where it pays someone (which is usually the case) - it can repeat this transaction later regardless to the will of the other user. Moreover, if that user later spends only the inputs received from the attacker - then the attacker can repeat those transactions too, and so on. Everything can be 'replayed' up to some depth, where more inputs are needed, that were not originally received from the attacker.

If only beams are traded, then there's no big problem. Because of those 'replays' users can only get paid, not loose their funds. But replaying asset-controlling transaction IS a problem. Because assets are essentially created from 'thin air', by replaying some transactions the attacker may cause extra asset emission (which is already a big problem), and may even be able to get some amount of this asset.

To mitigate this threat, starting from Fork2 duplicating kernels will be forbidden. Technically this is achieved by the following:

There will be a new consensus parameter, MaxKernelLifespan, probably equivalent to ~ 1 month.
Starting from Fork2, kernels with HeightLock.Min (minimum height) lower than Fork2 will be rejected.
Each kernel, in addition to the optional HeightLock.Max (maximum height) will have implicit max height lock as HeightLock.Min + MaxKernelLifespan. This (and the previous restriction) will make repeating old kernels impossible.
Each node will have to keep track of all the recent kernels, down to current height minus MaxKernelLifespan. Kernels below this height may be forgotten (for the sake of blockchain verification).
Side effect: Kernels with relative height lock (already available on the mainnet) will not be able to reference a kernel older than MaxKernelLifespan. But this is ok, practically relative locks are needed for much shorter duration. By such we will make kernel replaying illegal, whereas nodes will have to keep track only of the most recent kernels.

Shielded pool (a.k.a. Lelantus-MW)

Disclaimer: The Lelantus Protocol is the work of Zcoin's cryptographer Aram Jivanyan as part of its research to improve its privacy protocol. Our design and implementation are based on the publicly-available Lelantus scientific paper. All our code was developed from scratch based on this paper alone.

In order to solve the MW linkability problem, users will be able to recycle their funds via shielded pool. Our design is different from the original Lelantus protocol in the following ways:

Transaction values are never revealed
Instead of transactions, it's formulated in terms of mint/spend primitives, and the final transactions are composed of MW and shielded parts in any combinations, keeping the balance-to-zero principle (MW-style).
CA are naturally supported

Technically in addition to standard transaction elements, the following are supported:

Shielded output - transfers some amount from MW into shielded pool
Shielded input - withdraws some amount from the shielded pool back into MW Both elements are encoded as special transaction kernels.

In addition to the standard MW blinding factor generator G, there's an additional generator J for the secondary blinding factor, a.k.a. serial number.

Shielded output

Consists of the following:

Blinded serial number commitment: $$C_s = k_s•G + s•J$$
Generalized Schnorr's signature that proves the above commitment is indeed of this form
Optionally asset info: the blinded asset generator + asset surjection proof.
UTXO commitment $$C_ = k_•G + v•H$$
Rangeproof

In order to verify the overall transaction balance - only the UTXO commitment $$C_$$ (without the serial number) is accounted for. After verification, instead of going to the UTXO set, the following double-blinded commitment goes into the shielded pool:

$$C = C_s + C_ = s•J + (k_s + k_)•G + v•H$$

The shielded outputs in the pool form a sequence of commitments (EC points).

The serial number s is derived from another public key SpendKey, which will need to be revealed during spending. In addition the prover will need to prove the knowledge of the appropriate private key.

In addition, the $$C_s$$ commitment must be unique. This is to prevent accidental misuse, which will make subsequent element withdrawal impossible.

Shielded input

Consists of the following:

Range within the shielded pool, that contains the being-spent element.
SpendKey is revealed, and the whole shielded input is signed by the appropriate private key
Optionally asset info: the blinded asset generator + asset surjection proof.
Output commitment $$C_ = k_•G + v•H$$
- It should commit to the same value, but the blinding factor $$k_$$ is different from that used in shielded output.
Generalized Schnorr's signature, that proves the $$C_$$ is indeed of this form.
Sigma proof for the rest

The SpendKey must be unique, this way double-spend is prevented.

During the verification, the verifier computes the serial number s from the SpendKey. Then the following is calculated:

$$C = C_ + s•J$$ This EC point is subtracted (methodically) from all the elements in the referenced range of the shielded pool.

If everything is correct, then the element being-spent turns into:

$$C = (k_s + k_ - k_)•G$$ Note that both asset and serial number generators H and J are eliminated. The prover then proves knowledge of opening of one of the elements in the range in terms of G-generator only.

Both asset and serial number generators H and J are eliminated. The prover then proves knowledge of opening of one of the elements in the range in terms of G generator only.

One-side payments, and direct anonymous payments

In addition to solving the linkability problem, shielded pool allows one-side payments (normally in MW transactions are built mutually). This is due to the fact that serial number is derived from an arbitrary public key SpendKey, which, after initial setup, may be calculated by the sender alone, without the knowledge of the appropriate private key (and, hence, the ability to spend it).

This already provides the one-side payments ability. However it's not completely anonymous: since the sender knows the SpendKey - it can see when the receiver spends it. But this can be solved too, due to the fact that the shielded output consists of 2 parts: the $$C_s$$ and $$C_$$. During the initial setup the receiver generates and sends arbitrary number of different $$C_s$$ elements (with their Schnorr's signatures). The sender will use them as-is in the shielded output, without the knowledge of the serial number.

We incorporated a scheme by which the receiver detects all its shielded outputs by scanning the blockchain (i.e. no auxiliary channel is needed to notify the receiver). For $$C_s$$ all the owner info is embedded within the Schnorr's signature (which has a degree of freedom). For the $$C_$$ all the needed info is recovered from the bulletproof.

At the end the following information is recovered:

All the relevant parameters: blinding factor, SpendKey, value, AssetID
Is it visible to the sender, i.e. was the $$C_s$$ created by the sender or the receiver in advance.
Sender ID (a public key belonging to the sender)
Arbitrary 32-byte message

This info can be obtained by the so-called Owner key, but still in order to spend it - the master key is required. This allows to use the owner key in owned nodes to detect owned TXOs and shielded elements, without the risk of loosing the funds if the node is compromised.

Implications and constraints

The Lelantus is a great technology, but it comes at a price.

Scalability (size)
- Obviously no cut-through for the shielded inputs/outputs
  - Shielded output ~800 bytes
  - Shielded input ~1.6KB, depends on the anonymity set size
  - If asset type is blinded: 2 more asset proofs (for output and input), another ~2K
- But cut-through is still applied on the MW part
Verification time
- ~1sec for 64K elements (very big)
- Easily parallelized
- Only 10ms for each additional proof for the same anonymity set (batch verification)
- During initial sync many blocks can be batch-verified at once as well

So, in order to build a sane system, which enjoys the benefits of MW, but helps break the linkability, we design it this way:

Most of transactions should remain in MW
Max number of shielded inputs/outputs in a block is limited. Users will have to compete for them (fee market)
The spend window (anonymity set size) is limited, and dramatically decreased if the element being-spent is not one of the most recent.

The maximum spend window (anonymity set size) will probably be ~50K - 100K (not decided yet). The maximum number of shielded elements in a block will be tuned such that this window will be created within at least several days.

Another important restriction: users will be able to spend their shielded element with the maximum spend window only if it references the most recent elements. It won't be possible to specify a large spend window, that covers a range older than twice this window size.

In simple words, users will have a time window to spend their element "nicely". If they miss their opportunity - they'll have to spend it in a dramatically smaller spend window (~1K elements), but then they will be able to recycle it through shielded pool again.

By such we expect to keep good scalability and performance:

Not too many elements that can't be cut-through
Reasonable verification times: shielded inputs will have large overlap.

But importantly those restrictions will also lead to better privacy. Here's why.

Privacy

To understand which privacy is achieved while hiding in a crowd, let's first define absolute and relative anonymity sets.

The absolute anonymity set size is the net size of the set chosen by the user.
The relative anonymity set size is the ratio of the chosen absolute set size, to the weighted overall set, where the user could potentially hide, with appropriate probabilities.

Speaking simply, the relative set size is a probability of a user to choose a specific absolute set.

To achieve high privacy both the absolute and the relative sets should be maximized.

Obviously if the absolute set size is small, then the user is already suspected.
If the relative set size is small then the user can be deanonymized by a number of recurring transactions, even if the absolute anonymity set is big! A good explanation by Ian Miers is here.

Because the anonymity set size in Lelantus is finite, we need a compromise.

If too few users use it, then every user is already is a suspect.
If too many users use it, then the window is filled within shorter time period, which means smaller relative set (smaller probability of an unrelated user to fall into the same set).

The systems with unlimited anonymity set size (like Zcash) have an advantage here. However, speaking practically, the difference may be not that big. Although theoretically users can spend any element, practically they probably spend their recent outputs anyway (because of the usage nature). So the information leaked in Lelantus is considerable, but could be assumed by the attacker with significant probability anyway.

Probably real-world usage data is needed to estimate the practical privacy of the system.