An inference system designed for handling 10-100 billion parameter transformer models efficiently.
Jan 1, 2022