Reinforceent Learmmg Agent in Python Icon

News

reinforcement-learning-from-human-feedback

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL) ...

Mistral launches API for building AI agents that run Python, generate images, perform RAG and more

For professionals like the Lead AI Engineer or Senior AI Engineer, the Mistral Agents API represents a powerful addition to ...

s3: The new RAG framework that trains search agents with minimal data

S3 decouples RAG search from generation, boosting efficiency and generalization for enterprise LLM applications with minimal data.

LangChain Sandbox Run Untrusted Python Safely for AI Agents

Discover how LangChain Sandbox ensures safe Python code execution for AI developers, protecting systems from unverified code ...

WinBuzzer2d

Mistral’s Agents API Gets Toolkit for Advanced AI Agents with MCP Support

Mistral AI launches its new Agents API, offering developers advanced tools like code execution, RAG, and MCP support for building sophisticated AI agents, aligning with OpenAI and Anthropic.

GitHub2d

multiagent-reinforcement-learning

Clean, documented implementations of PPO-based algorithms for cooperative multi-agent reinforcement learning, focusing on SMAC environments. Features MLP and RNN-based MAPPO with various normalization ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results