Medical QA Datasets Collection A collection of medical question answering (QA) datasets β’ 20 items β’ Updated 6 days ago β’ 25
view post Post 2552 Reply Here is my latest study on OpenAIπo1π. A Case Study of Web App Coding with OpenAI Reasoning Models (2409.13773)I wrote an easy-to-read blogpost to explain finding.https://huggingface.co/blog/onekq/daily-software-engineering-work-reasoning-modelsINSTRUCTION FOLLOWING is the key.100% instruction following + Reasoning = new SOTABut if the model misses or misunderstands one instruction, it can perform far worse than non-reasoning models. π§ 9 9 π 2 2 π₯ 1 1 +