- Notifications
You must be signed in to change notification settings - Fork 86
Open
Labels
type:supportFurther information is requestedFurther information is requested
Description
Seems like that due to atomicity we create a recursive copy of the checkpoint on every new entry.
https://github.com/google/orbax/blob/main/checkpoint/orbax/checkpoint/_src/path/snapshot/snapshot.py#L72
For the regular NFS/S3 that doesn't support copy-on-write, this actually leads to O(N^2) traffic usage, because we make a full copy of existing checkpoint on each update. Is there a way to avoid that, even at the cost of disabling the atomicity? I don't really need it in my case by still want to use partial writes.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
type:supportFurther information is requestedFurther information is requested