A tool for monitoring and managing computing resources across multiple hosts
TensorHive TensorHive is an open source tool for monitoring and managing computing resources across multiple hosts. It solves the most common problems and nightmares about accessing and sharing your AI-oriented infrastructure across multiple, often competing users. It’s designed with simplicity, flexibility and configuration-friendliness in mind. Main features: GPU Reservation calendarEach column represents all reservation events for a GPU on a given day. In order to make a new reservation simply click and drag with your mouse, select GPU(s), add some […]
Read more