Nothing happened :), half-precision kernel is not implemented for CPUs. 565d0b3 AAOBA commited on Dec 2, 2023
Using FP16 for inference, trying to avoid weird, stupidly long inference time. b182a23 AAOBA commited on Dec 2, 2023