(Expand to View)
| Paper | Comments |
|---|---|
| 100 instances is all you need: predicting the success of a new LLM on unseen data by testing on a few instances | - |
| xLAM: A Family of Large Action Models to Empower AI Agent Systems | - |
| Planning In Natural Language Improves LLM Search For Code Generation | - |
| Game On: Towards Language Models as RL Experimenters | - |
| Report Cards: Qualitative Evaluation of Language Models Using Natural Language Summaries | - |
| ATTENTION HEADS OF LARGE LANGUAGE MODELS: A SURVEY | - |
If you are intereted in the work published by us, please navigate to our full paper list.