LLM accuracy test for coding in a specific field, but each model are only tested with up to 4 model sizes.
Prompt (all models):
implement a whole chess engine in python with python-chess
Outputs (reasoning discarded) for DeepSeek:
To implement a chess engine using Python and the Python chess library, follow these steps:
- Install Dependencies