Large scale embeddings on a single machine
Marius is a system under active development for training embeddings for large-scale graphs on a single machine. Training on large scale graphs requires a large amount of data movement to get embedding parameters from storage to the computational device.Marius is designed to mitigate/reduce data movement overheads using: Pipelined training and IO Partition caching and buffer-aware data orderings Details on how Marius works can be found in our OSDI ’21 Paper, where experiment scripts and configurations can be found in the […]
Read more