Internet-Draft hpke-mlkem October 2024
Connolly Expires 22 April 2025 [Page]
Workgroup:
Crypto Forum
Internet-Draft:
draft-connolly-cfrg-hpke-mlkem-04
Published:
Intended Status:
Informational
Expires:
Author:
D. Connolly
SandboxAQ

ML-KEM for HPKE

Abstract

This document defines Module-Lattice-Based Key-Encapsulation Mechanism (ML-KEM) KEM options for Hybrid Public-Key Encryption (HPKE). ML-KEM is believed to be secure even against adversaries who possess a cryptographically-relevant quantum computer.

About This Document

This note is to be removed before publishing as an RFC.

The latest revision of this draft can be found at https://dconnolly.github.io/draft-connolly-cfrg-hpke-mlkem/draft-connolly-cfrg-hpke-mlkem.html. Status information for this document may be found at https://datatracker.ietf.org/doc/draft-connolly-cfrg-hpke-mlkem/.

Discussion of this document takes place on the Crypto Forum Research Group mailing list (mailto:cfrg@ietf.org), which is archived at https://mailarchive.ietf.org/arch/search/?email_list=cfrg. Subscribe at https://www.ietf.org/mailman/listinfo/cfrg/.

Source for this draft and an issue tracker can be found at https://github.com/dconnolly/draft-connolly-cfrg-hpke-mlkem.

Status of This Memo

This Internet-Draft is submitted in full conformance with the provisions of BCP 78 and BCP 79.

Internet-Drafts are working documents of the Internet Engineering Task Force (IETF). Note that other groups may also distribute working documents as Internet-Drafts. The list of current Internet-Drafts is at https://datatracker.ietf.org/drafts/current/.

Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress."

This Internet-Draft will expire on 22 April 2025.

Table of Contents

1. Introduction

1.1. Motivation

ML-KEM [FIPS203] is a Key-Encapsulation Mechanism (KEM) which is believed to be secure against both classical and quantum cryptographic attacks. For parties that must move to exclusively post-quantum algorithms, this document defines pure post-quantum algorithms for the Hybrid Public-Key Encryption (HPKE) protocol [RFC9180]. ML-KEM as a post-quantum IND-CCA2-secure KEM fits nicely into HPKE's design. Supporting multiple security levels for ML-KEM allows a spectrum of use cases including settings where the (United States) National Institute of Standards (NIST) security category 5 is required.

1.2. Not an authenticated KEM

ML-KEM is a plain KEM that does not support the static-static key exchange that allows HPKE based on Diffie-Hellman (DH) based KEMs and their (optional) authenticated modes.

2. Conventions and Definitions

The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "NOT RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in BCP 14 [RFC2119] [RFC8174] when, and only when, they appear in all capitals, as shown here.

The following terms are used throughout this document to describe the operations, roles, and behaviors of HPKE:

GenerateKeyPair, DeriveKeyPair, SerializePublicKey, DeserializePublicKey, Encap, Decap, AuthEncap, AuthDecap, Nsecret, Nenc, Npk, and Nsk are defined in Section 4 of [RFC9180].

When used in the Security Consideration section, PK refers to public key and CT refers to ciphertext as modeled in [CDM23].

TODO: explain or reference IND-CCA, IND-CCA2, MAL-BIND-K-PK, MAL-BIND-K-CT, and LEAK-BIND-K-PK.

3. Usage

[FIPS203] supports two different key formats - this document only supports the 64-byte seed (d, z). This format supports stronger binding properties for ML-KEM than the expanded format. The 64-byte seed format protects against re-encapsulation attacks. This format provides properties closer to the generic DHKEM binding properties defined in Section 4.1 of [RFC9180].

The encapsulation and decapsulation keys are computed, serialized, and deserialized as described in [FIPS203] where the decapsulation keys MUST be in the 64-byte (d, z) format. The 'expanded' format where the decapsulation key is expanded into a variable size based on the parameter set but includes the hash of the encapsulation key is not used.

3.1. Key generation

ML-KEM satisfies the HPKE KEM function GenerateKeyPair(), the randomized algorithm to generate a key pair, via Algorithm 19 ML-KEM.KeyGen() in [FIPS203]. To be explicit, we use only the seed format (d, z) generated by lines 1 and 2 of Algorithm 19 ML-KEM.KeyGen() of [FIPS203] and stored securely as described in section 7.1 of [FIPS203].

def GenerateKeyPair():
    d = random(32) # `random(n)` MUST comply with {{FIPS203}}'s RBG requirements
    z = random(32) # `random(n)` MUST comply with {{FIPS203}}'s RBG requirements
    (ek, _) = ML-KEM.KeyGen_internal(d, z)
    return (concat(d, z), ek)

3.2. Key derivation

ML-KEM satisfies the HPKE KEM function DeriveKeyPair(ikm), the deterministic algorithm to derive a key pair from the byte string ikm, where ikm SHOULD have at least Nsk bytes, via Algorithm 16 ML-KEM.KeyGen_internal(d, z) in [FIPS203].

The input ikm is the 64-byte decapsulation key (d, z), described as the seed in section 7.1 in [FIPS203]. The 64 bytes of ikm MUST be generated according to section 7.1, Algorithm 19, of [FIPS203], that is by freshly sourcing 32 random bytes for d and then freshly sourcing another 32 random bytes for z from a FIPS-approved RBG.

The RBG MUST have a security strength of at least 128 bits for ML-KEM-512, at least 192 bits for ML-KEM-768, and at least 256 bits for ML-KEM-1024.

3.3. Public key serialization

The HPKE KEM function SerializePublicKey() is the identity function, since the ML-KEM already uses fixed-length byte strings for public encapsulation keys per parameter set.

3.4. Public key deserialization

The HPKE KEM function DeserializePublicKey() is the identity function, since the ML-KEM already uses fixed-length byte strings for public encapsulation keys per parameter set.

3.5. Encapsulation

ML-KEM satisfies the HPKE KEM function Encap(pkR) via Algorithm 20, ML-KEM.Encaps(ek), of [FIPS203], where an ML-KEM encapsulation key check failure causes an HPKE EncapError.

3.6. Decapsulation

ML-KEM satisfies the HPKE KEM function Decap(enc, skR) via Algorithm 21, ML-KEM.Decaps(dk, c), of [FIPS203], where an ML-KEM ciphertext check failure or decapsulation key check failure or hash check failure cause an HPKE DecapError.

To be explicit, we derive the expanded decapsulation key from the 64-byte seed format and invoke ML-KEM.Decaps(dk) with it:

def Decap(enc, skR):
    (sk, _) = DeriveKeyPair(skR) # expand decapsulation key from 64-byte format
    return ML-KEM.Decaps(sk, enc)

3.7. AuthEncap and AuthDecap

HPKE-ML-KEM is not an authenticated KEM and does not support AuthEncap() nor AuthDecap(), see Section 1.2.

4. Security Considerations

HPKE's IND-CCA2 security relies upon the IND-CCA and IND-CCA2 security of the underlying KEM and AEAD schemes, respectively. ML-KEM is believed to be IND-CCA secure via multiple analyses.

The HPKE key schedule is independent of the encapsulated KEM shared secret ciphertext and public key of the ciphersuite KEM, and dependent on the shared secret produced by the KEM. If HPKE had committed to the encapsulated shared secret ciphertext and public key, we wouldn't have to worry about the binding properties of the ciphersuite KEM's X-BIND-K-CT and X-BIND-K-PK properties. These computational binding properties for KEMs were formalized in [CDM23]. [CDM23] describes DHKEM as MAL-BIND-K-PK and MAL-BIND-K-CT secure as a result of the inclusion of the serialized DH public keys (the KEM's PK and CT) in the DHKEM Key Derivation Function (KDF). MAL-BIND-K-PK and MAL-BIND-K-CT security ensures that the shared secret K 'binds' (is uniquely determined by) the encapsulation key and/or the ciphertext, even when the adversary is able to create or modify the key pairs or has access to honestly-generated leaked key material.

ML-KEM as specified in [FIPS203] with the seed key format provides MAL-BIND-K-CT security and LEAK-BIND-K-PK security [KEMMY24]. LEAK-BIND-K-PK security is resiliant where the involved key pairs are output by the honest key generation algorithm of the KEM and then leaked to the adversary. MAL-BIND-K-CT security strongly binds the shared secret and the ciphertext even when an adversary can manipulate key material like the decapsulation key.

ML-KEM using the seed key format (providing MAL-BIND-K-CT and LEAK-BIND-K-PK) nearly matches the binding properties of DHKEM (the default HPKE KEM construction). The ML-KEM ciphertext is strongly bound by the shared secret. The encapsulation key is more weakly bound, and protocols integrating HPKE using ML-KEM even with the seed key format should evaluate whether they need to strongly bind to the PK elsewhere (outside of ML-KEM or HPKE) to be resilient against a MAL adversary, or to achieve other tight binding to the encapsulation key PK to achieve properties like implicit authentication or session independence.

5. IANA Considerations

This document requests/registers two new entries to the "HPKE KEM Identifiers" registry.

Value:

0x0040 (please)

KEM:

ML-KEM-512

Nsecret:

32

Nenc:

768

Npk:

800

Nsk:

64

Auth:

no

Reference:

This document

Value:

0x0041 (please)

KEM:

ML-KEM-768

Nsecret:

32

Nenc:

1088

Npk:

1184

Nsk:

64

Auth:

no

Reference:

This document

Value:

0x0042 (please)

KEM:

ML-KEM-1024

Nsecret:

32

Nenc:

1568

Npk:

1568

Nsk:

64

Auth:

no

Reference:

This document

6. References

6.1. Normative References

[FIPS203]
"Module-Lattice-Based Key-Encapsulation Mechanism Standard", National Institute of Standards and Technology, DOI 10.6028/nist.fips.203, , <https://doi.org/10.6028/nist.fips.203>.
[RFC2119]
Bradner, S., "Key words for use in RFCs to Indicate Requirement Levels", BCP 14, RFC 2119, DOI 10.17487/RFC2119, , <https://www.rfc-editor.org/rfc/rfc2119>.
[RFC8174]
Leiba, B., "Ambiguity of Uppercase vs Lowercase in RFC 2119 Key Words", BCP 14, RFC 8174, DOI 10.17487/RFC8174, , <https://www.rfc-editor.org/rfc/rfc8174>.
[RFC9180]
Barnes, R., Bhargavan, K., Lipp, B., and C. Wood, "Hybrid Public Key Encryption", RFC 9180, DOI 10.17487/RFC9180, , <https://www.rfc-editor.org/rfc/rfc9180>.

6.2. Informative References

[CDM23]
Cremers, C., Dax, A., and N. Medinger, "Keeping Up with the KEMs: Stronger Security Notions for KEMs and automated analysis of KEM-based protocols", , <https://eprint.iacr.org/2023/1933.pdf>.
[KEMMY24]
Schmieg, S., "Unbindable Kemmy Schmidt: ML-KEM is neither MAL-BIND-K-CT nor MAL-BIND-K-PK", , <https://eprint.iacr.org/2024/523.pdf>.

Appendix A. Acknowledgments

The authors would like to thank Cas Cremers for their input.

Appendix B. Change log

TODO

Appendix C. Test Vectors

This section contains test vectors formatted similary to that which are found in [RFC9180], with two changes. First, we only provide vectors for the non-authenticated modes of operation. Secondly, as ML-KEM encapsulation does not involve an ephemeral keypair, we omit the ikmE, skEm, pkEm entries and provide an ier entry instead. The value of ier is the randomness used to encapsulate, so ier[0:32] is the seed that is fed to H in the first step of ML-KEM encapsulation in [FIPS203].

Author's Address

Deirdre Connolly
SandboxAQ