Moozonian
About 12 results
✨ Moozonian AI is reading the web for you...
github.com
github.com › ShishirPatil › gorilla
GitHub - ShishirPatil/gorilla: Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Feb 21, 2026 — Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls) - ShishirPatil/gorilla
⏱ 10 min read
www.deeplearning.ai
deeplearning.ai › sh...evaluating-ai-agents
Evaluating AI Agents - DeepLearning.AI
Feb 20, 2026 — Learn how to systematically evaluate, improve, and iterate on AI agents using structured assessments.
⏱ 4 min read
doi.org
doi.org › 10.18637%2Fjss.v011.i04
Evaluating the Normal Distribution | Journal of Statistical Software
Feb 22, 2026 —
arxiv.org
arxiv.org › abs › 2402.19450
[2402.19450] Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning Gap
Feb 21, 2026 — We propose a framework for robust evaluation of reasoning capabilities of language models, using functional variants of benchmarks. Models that solve a reasoning test should exhibit no difference in p...
⏱ 4 min read
doi.org
doi.org › 10.1023%2FA%3A1013298507114
Theory of Mind for a Humanoid Robot | Autonomous Robots | Springer Nature Link
Feb 21, 2026 — If we are to build human-like robots that can interact naturally with people, our robots must know not only about the properties of objects but also the properties of animate agents in the world. One ...
⏱ 8 min read
aka.ms
aka.ms › raft-repo
gorilla/raft at main · ShishirPatil/gorilla · GitHub
Feb 21, 2026 — Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls) - ShishirPatil/gorilla
aka.ms
aka.ms › raft-repo
gorilla/raft at main · ShishirPatil/gorilla · GitHub
Feb 21, 2026 — Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls) - ShishirPatil/gorilla
aka.ms
aka.ms › goex-out-docker-sandbox
gorilla/goex/exec_engine/api_executor.py at 34ae76099a21aaf01df434ebca33f83489b55096 · ShishirPatil/gorilla · GitHub
Feb 21, 2026 — Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls) - ShishirPatil/gorilla
aka.ms
aka.ms › goex-out-undo-decision
gorilla/goex/cli.py at 34ae76099a21aaf01df434ebca33f83489b55096 · ShishirPatil/gorilla · GitHub
Feb 21, 2026 — Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls) - ShishirPatil/gorilla
aka.ms
aka.ms › goex-out-undo-prompt
gorilla/goex/exec_engine/pipeline.py at 34ae76099a21aaf01df434ebca33f83489b55096 · ShishirPatil/gorilla · GitHub
Feb 21, 2026 — Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls) - ShishirPatil/gorilla
aka.ms
aka.ms › goex-out-forward-prompt
gorilla/goex/exec_engine/pipeline.py at 34ae76099a21aaf01df434ebca33f83489b55096 · ShishirPatil/gorilla · GitHub
Feb 21, 2026 — Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls) - ShishirPatil/gorilla
aka.ms
aka.ms › goex-out-github-readme
gorilla/goex at main · ShishirPatil/gorilla · GitHub
Feb 21, 2026 — Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls) - ShishirPatil/gorilla