Sites

Trends

Topics

Nodes

Search for keywords, #hashtags, $sites, add a dash to exclude, e.g. -$theonion.com

From github.io

Limit of RLVR

7 8 new

Reasoning LLMs Are Just Efficient Samplers: RL Training Elicits No Transcending Capacity

#rl #llm #ppo #aime #grpo #qwen #rlvr #github #ngated #academia

on Mon, 7AM

From simonwillison.net

AI assisted search-based research actually works now

4 22

For the past two and a half years the feature I’ve most wanted from LLMs is the ability to take on search-based research tasks on my behalf. We saw the …

#ai #o3 #llms #gemini #google #o4mini #openai #search #aisearch #chatbots

on Mon, 2PM

From techspot.com

Open source AI is the new Linux, only faster

3 3

When DeepSeek launched its advanced model on Hugging Face, it marked a turning point for AI and the global open-source movement, says Asay. DeepSeek's release sparked a...

#ai #llms

19h ago

From giskard.ai

RAG Evaluation Toolkit on a Banking Supervisory Process Agent - Giskard Documentation

1 1

Learn more about Giskard RAG Evaluation Toolkit on a Banking Supervisory Process Agent | The Testing platform for AI models.

#rag #llms #aitesting

1h ago

From winbuzzer.com

Study: AI-Powered Research Prowess Now Outstrips Human Experts, Raising Bioweapon Risks - WinBuzzer

1 1

Experts have been surpassed by AI in virology lab troubleshooting according to a new study, leading to dual-use warnings and responses from AI labs like OpenAI.

#ai #vct #llms #dualuse #science #aiethics #aisafety #research #virology #biohazard

2h ago

From github.io

Readings shared April 22, 2025

1 1

The readings shared in Bluesky on 22 April 2025 are The inverse method is a good fit for Datalog theorem proving. ~ Philip Zucker. #Datalog #Logic In between myth and reality: AI for math (a case stu

#llms #math #logic #python #datalog #haskell #commonlisp #leanprover #isabellehol

8h ago

From bsky.app

Rob Sica (@robsica.bsky.social)

1 1

"It would be more appropriate to say that [LLMs] ‘confabulate’. In humans, confabulation involves the unintentional – and usually linguistic – fabrication of content: making stuff up without realising that one is doing so. It is primarily about *doing*, rather than...

#ai #llms

9h ago

From substack.com

The Growing Medium for API Ecosystems

1 1

Why shared state matters more than ever

#llms #genai #api360 #mikeamundsen #microservices

19h ago

From winbuzzer.com

Experts Challenge Validity and Ethics of Crowdsourced AI Benchmarks Like LMArena (Chatbot Arena) - WinBuzzer

1 1

Following its incorporation, LMArena (Chatbot Arena) has encountered expert critiques concerning the validity and ethics of its widely cited AI model leaderboard.

#ai #llms #genai #lmarena #aiethics #aimodels #aibenchmarks #aievaluation #chatbotarena #crowdsourcing

20h ago

From github.io

EM-LLM: Human-inspired Episodic Memory for Infinite Context LLMs

1 1

A novel approach integrating human-like episodic memory into Large Language Models for enhanced long-context processing

#rag #llms #emllm

21h ago

From winbuzzer.com

OpenAI Adopts Rival Anthropic’s MCP Standard, Joining Industry Push for AI Interoperability - WinBuzzer

1 1

OpenAI has announced support for Anthropic's Model Context Protocol (MCP), joining Microsoft, AWS, and Google in adopting the open standard for AI agents.

#ai #api #mcp #sdk #llms #openai #aiagents #devtools #anthropic #opensource

23h ago

From github.com

GitHub - x1xhlol/system-prompts-and-models-of-ai-tools

1 4

Contribute to x1xhlol/system-prompts-and-models-of-ai-tools development by creating an account on GitHub.

#llm #leak #llms #coding #openai #windsurf #promptinjection

on Mar 14

From github.com

courses/prompt_engineering_interactive_tutorial at master · anthropics/courses

1 1

Anthropic's educational courses. Contribute to anthropics/courses development by creating an account on GitHub.

#ai #llms #claude #chatbots #anthropic #generativeai #promptengineering

on Aug 29