Using FP16 for inference, trying to avoid weird, stupidly long inference time. b182a23 AAOBA commited on Dec 2, 2023