This is an old revision of the document!
Table of Contents
Using codecs (DRAFT)
What is a codec
A codec, introducted in Java Edition 1.16, is a specification of conversion between any type of object (such as LootTable
, Advancement
or BlockPos
) and any type serialized from (nbt, json, etc.). A codec is a combination of encoder and decoder. An encoder encodes an object to serialized form, and a decoder decodes the serialized from into objects.
For example, loot tables are written in json forms in data packs, and are loaded as LootTable
objects in the server. To load loot tables from the jsons in the data pack, the codec is used. There are many pre-written codecs in Minecraft, each of which, is used for a specific type of object, but can serialize or deserialize between it and different types of serialized from. For example, LootTable.CODEC
can convert LootTable
objects into jsons and nbts, and can also convert jsons or nbts into LootTable
objects.
Codec was introduced in 1.16, but has been increasingly widely used in Minecraft since 1.20. For example, LootTable
, Advancements
and Text
, previously serialized or deserialized in other manners in older versions, but now using codecs. Item components, introduced in 1.20.5, also use codecs to serialize.
How to use a codec
When you serialize or deserialize, you need to specify a DynamicOps
, which specifies each concrete action in the serialization or deserialization. Mose common used are JsonOps.INSTANCE
and NbtOps.INSTANCE
. The following code takes an example of converting between BlockPos
and NbtElement
.
// serializing a BlockPos final BlockPos blockPos = new BlockPos(1, 2, 3); final DataResult<NbtElement> encodeResult = BlockPos.CODEC.encodeStart(NbtOps.INSTANCE, blockPos); // deserializing a BlockPos final NbtList nbtList = new NbtList(); nbtList.add(NbtInt.of(1)); nbtList.add(NbtInt.of(2)); nbtList.add(NbtInt.of(3)); final DataResult<Pair<BlockPos, NbtElement>> decodeResult = BlockPos.CODEC.decode(NbtOps.INSTANCE, nbtList);
As seen in the code, the result is not directly NbtElement
or BlockPos
, but a DataResult
wrapping them. That's because errors are common in serialization, and instead of exceptions, errors in the data results often happens. You can fetch the result with result()
(which may be empty when error happens), or fecth the error message with error()
(which may be empty when error does not happen). You can also directly fetch the result with getOrThrow()
(or Util.getResult
in older versions).
// get the result when succeed final NbtList nbtList = new NbtList(); nbtList.add(NbtInt.of(1)); nbtList.add(NbtInt.of(2)); nbtList.add(NbtInt.of(3)); final DataResult<Pair<BlockPos, NbtElement>> result1 = BlockPos.CODEC.decode(NbtOps.INSTANCE, nbtList); System.out.println(result1.getOrThrow().getFirst()); // get the result when error final NbtString nbtString = NbtString.of("can't decode me"); final DataResult<Pair<BlockPos, NbtElement>> result2 = BlockPos.CODEC.decode(NbtOps.INSTANCE, nbtString); System.out.println(result2.error().get().message());
There is a special type of DynamicOps
, named RegistryOps
, which is usually created with RegistryWrapper.getOps
. It is a wrapper of other ops, which means, when encoding or decoding, it behaves basically identical to the wrapped ops, but have some differences: when encoding or decoding some registry entries, it will use the specified RegistryWrapper
to get objects or entries, while other ops may directly use the registry or even throw an error.
How to write a codec
Mapping existing codec
For simple objects, you can directly convert it between those have existing codecs. Mojang provides codecs for all primitive types and common types. In this example, you want to create a codec for Identifier
. We just know that it can be converted from and to a String
, then we map through xmap
:
Codec<Identifier> identifierCodec = Codec.STRING.xmap(s -> new Identifier(s), identifier -> identifier.toString());
When decoding, the serialized from is converted into string through Codec.STRING
, and then converted into Identifier
through the first lambda. When encoding, the Identifier
is converted into string through the second lambda, and then encoded through Codec.STRING
.
If the serialized from is something that cannot be converted into a string, a codec result may be DataResult.Error
, and handled properly as expected. However, if you pass a string that cannot be an Identifier, such as NbtString.of(“ABC”)
, decoding it will directly throw InvalidIdentifierException
. Therefore, to handle errors properly, we use flatXmap
:
Codec<Identifier> identifierCodec = Codec.STRING.flatXmap(s -> { try { return DataResult.success(new Identifier(s)); } catch (InvalidIdentifierException e) { return DataResult.error(() -> "The identifier is invalid:" + e.getMessage()); } }, identifier -> DataResult.success(identifier.toString()));
Note that this time we use flatXmap
instead of xmap
, in which, the two lambdas returns DataResult
. This is used then two lambdas may return failed results. In this case, the second lambda does not fail, so we can also use comapFlatMap
:
Codec<Identifier> identifierCodec = Codec.STRING.comapFlatMap(s -> { try { return DataResult.success(new Identifier(s)); } catch (InvalidIdentifierException e) { return DataResult.error(() -> "The identifier is invalid:" + e.getMessage()); } }, identifier -> identifier.toString());
Record codec
Required fields
Most objects are complicated, which cannot simply represented in primitive type. It may be in a form of a map-like object with multiple fields, such as NbtCompound
or JsonObject
. In this case, you need to specify the fields, including name and codecs of the fields. For example, we create a record type:
public record Student(String name, int id, Vec3d pos) { }
To create a codec for this record, we should specify the fields codec, specify how to get fields from a Student
object, and specify how to create a Student
object from the fields:
public static final Codec<Student> CODEC = RecordCodecBuilder.create(i -> i.group( Codec.STRING.fieldOf("name").forGetter(Student::name), Codec.INT.fieldOf("id").forGetter(Student::id), Vec3d.CODEC.fieldOf("pos").forGetter(Student::pos) ).apply(i, Student::new));
In this example, the method group
takes three fields, specifying each field of the record. The parameter of fieldOf
is the name of the field, which represents as keys of the encodec result. The method forGetter
specifies how to get the value of the field from the object, such as student → student.name()
(in lambda form) or Student::name
(in method reference form). The second parameter of apply
specified how to convert the field values to a complete object, such as (string, i, vec3d) → new Student(string, i, vec3d)
.
Let's demonstrate the effect of the codec:
new Student(“Steve”, 20, Vec3d.ZERO)
will be encoded as{name: Steve, id: 20, pos: [0, 0, 0]}
.new Student(“Alex”, 25, new Vec3d(1d, 2d, 3d))
will be encoded as{name: Alex, id: 25, pos: [1d, 2d, 3d]}
.{name: Steve, id: 20, pos: [0, 0, 0]}
will be decoded asnew Student(“Steve”, 20, Vec3d.ZERO)
.{name: Steve, id: 30}
cannot be decoded, because it misses a required fieldpos
.
Optional fields
Sometimes all fields are not required. In the preview field, if the encoded result misses some fields, error will be thrown. If you want to make fields optional, use optionalFieldOf
. You also need to specify a default value:
public static final Codec<Student> CODEC = RecordCodecBuilder.create(i -> i.group( Codec.STRING.fieldOf("name").forGetter(Student::name), Codec.INT.optionalFieldOf("id", 0).forGetter(Student::id), Vec3d.CODEC.fieldOf("pos").forGetter(Student::pos) ).apply(i, Student::new));
In this case, when decoding, when the field id
does not exist, the default value 0
will be taken. When encoding, when the field equals to default value 0
, it will be missing in the result. For example:
new Student(“Steve”, 0, Vec3d.ZERO)
will be encoded as{name: Steve, pos: [0, 0, 0]}
.{name: Steve, pos: [0, 0, 0]}
will be decoded asnew Student(“Steve”, 0, Vec3d.ZERO)
.- Of course,
{name: Steve, id: 0, pos: [0, 0, 0]}
will be also decoded asnew Student(“Steve”, 0, Vec3d.ZERO)
. {name: Steve, id: Hello, pos: [0, 0, 0]}
cannot be decoded, because the fieldid
is invalid.
Pay attention to the last example. When decoding, when an optional field has an invalid value, error will be thrown. Howevevr, in older Minecraft versions, when an optional field has an invalid value, the default value will be directly taken.
- In older Minecraft versions,
{name: Steve, id: Hello, pos: [0, 0, 0]}
will be encoded asnew Student(“Steve”, 0, Vec3d.ZERO)
.
In current versions, you can also replace optionalFieldOf
with lenientOptionalFieldOf
, so as to take default values when the value is invalid.
If you do not provide a default value for optionalFieldOf
or lenientOptionalFieldOf
, it will be a field codec for Optional<T>
. For example, if the name
of a Student
is optional:
public record Student(Optional<String> name, int id, Vec3d pos) { public static final Codec<Student> CODEC = RecordCodecBuilder.create(i -> i.group( Codec.STRING.optionalFieldOf("name").forGetter(Student::name), Codec.INT.fieldOf("id").forGetter(Student::id), Vec3d.CODEC.fieldOf("pos").forGetter(Student::pos) ).apply(i, Student::new)); }
Another from of field codec
The codec can also be written like this:
public record Student(String name, int id, Vec3d pos) { public static final Codec<Student> CODEC = RecordCodecBuilder.create(i -> i.apply3( Student::new, Codec.STRING.fieldOf("name").forGetter(Student::name), Codec.INT.fieldOf("id").forGetter(Student::id), Vec3d.CODEC.fieldOf("pos").forGetter(Student::pos) )); }
Writing like this may be simpler. You should specify the method to convert fields to an objects at first (Student::new
in the example, and then write field codecs following it, with allowing IDEs to properly provide suggestions. Remember the method name (apply3
in the example) should contain the number of fields. For example, sometimes you may also use apply4
, apply5
, etc. If your record codec contains only one field, use ap
.
Dispatching codec
Some objects may have different types. In the serialized form, it may need a “type”, and according to the “type”, specify a codec to deserialize or serialize.