Christian Lawson-Perfect @christianp

6 posts5 participants0 posts today

**Hacker News** @h4ckernews@mastodon.social · 8h

Hacker News @h4ckernews@mastodon.social

ART – a new open-source RL framework for training agents

GitHubGitHub - OpenPipe/ART: OpenPipe ART (Agent Reinforcement Trainer): train LLM agentsOpenPipe ART (Agent Reinforcement Trainer): train LLM agents - OpenPipe/ART

#HackerNews #OpenSource #RLFramework

**LavX News** @lavxnews@mastodon.cloud · 19h

19h

LavX News @lavxnews@mastodon.cloud

Unleashing AI Potential: Atropos Framework Revolutionizes LLM Reinforcement Learning

Introducing Atropos, a groundbreaking framework designed to enhance language models through innovative reinforcement learning environments. With its diverse capabilities and significant performance im...

https://news.lavx.hu/article/unleashing-ai-potential-atropos-framework-revolutionizes-llm-reinforcement-learning

#news #tech #LLM

**LavX News** @lavxnews@mastodon.cloud · 1d

LavX News @lavxnews@mastodon.cloud

The Evolution of the Like Button: AI's Role in Shaping Human Preferences

As AI technologies advance, the future of the like button on social media platforms is in flux. This article explores how AI is not only analyzing user preferences but also predicting and potentially ...

https://news.lavx.hu/article/the-evolution-of-the-like-button-ai-s-role-in-shaping-human-preferences

#news #tech #AIContentCreation

**LavX News** @lavxnews@mastodon.cloud · 1d

LavX News @lavxnews@mastodon.cloud

Harnessing AI for Stock Trading: The StockFormer Model on Glows.ai

The StockFormer model merges time series forecasting with reinforcement learning to revolutionize stock trading strategies. By leveraging advanced deep learning techniques, this innovative approach pr...

https://news.lavx.hu/article/harnessing-ai-for-stock-trading-the-stockformer-model-on-glows-ai

#news #tech #TransformerModels

**LavX News** @lavxnews@mastodon.cloud · 2d

LavX News @lavxnews@mastodon.cloud

Clippy vs. Anton: The Divergent Paths of AI Personalization

As AI continues to evolve, a fundamental debate emerges between the Clippy and Anton schools of thought, shaping how we interact with intelligent systems. This article delves into the implications of ...

https://news.lavx.hu/article/clippy-vs-anton-the-divergent-paths-of-ai-personalization

#news #tech #ChatGPT

**trndgtr.com** @trndgtr@mastodon.social · 3d

trndgtr.com @trndgtr@mastodon.social

Training AI to Persuade? - Jeremie & Edouard Harris on JRE

#reinforcementlearning #ai #aipersuasion

**Victoria Stuart** @persagen@mastodon.social · Apr 22 *

Apr 22 *

Victoria Stuart @persagen@mastodon.social

[AGI discussion, DeepMind] Welcome to the Era of Experience
https://storage.googleapis.com/deepmind-media/Era-of-Experience%20/The%20Era%20of%20Experience%20Paper.pdf
https://old.reddit.com/r/MachineLearning/comments/1k4zr1i/r_deepmind_welcome_to_the_era_of_experience

* threshold of new era in AI that promises unprecedented level of ability
* new generation of agents will acquire superhuman capabilities, learning predominantly f. experience
* paradigm shift, accompanied by algorithmic advancements in RL, will unlock new supra-human capabilities

#Google #DeepMind #AI

**LavX News** @lavxnews@mastodon.cloud · Apr 22

Apr 22

LavX News @lavxnews@mastodon.cloud

Unlocking Neural Networks: A Deep Dive into Machine Learning Architectures

Explore the intricacies of machine learning through neural networks in this comprehensive analysis of recent lecture notes by B. Mehlig. From Hopfield networks to convolutional neural networks, discov...

https://news.lavx.hu/article/unlocking-neural-networks-a-deep-dive-into-machine-learning-architectures

#news #tech #DeepLearning

**LavX News** @lavxnews@mastodon.cloud · Apr 20

Apr 20

LavX News @lavxnews@mastodon.cloud

Revolutionizing AI Agent Design: A Deep Dive into Agent Workflows and Cognitive Models

As AI agents evolve, understanding their design and operational frameworks becomes crucial. This article explores the nuances of agent workflows, the dichotomy of fast and slow thinking, and the impli...

https://news.lavx.hu/article/revolutionizing-ai-agent-design-a-deep-dive-into-agent-workflows-and-cognitive-models

#news #tech #AIAgents

**LavX News** @lavxnews@mastodon.cloud · Apr 19

Apr 19

LavX News @lavxnews@mastodon.cloud

Exploring the Untapped Potential of Non-LLM AI Development

As the tech world fixates on large language models and convolutional neural networks, a wealth of promising AI advancements beyond these paradigms is emerging. This article delves into the lesser-know...

https://news.lavx.hu/article/exploring-the-untapped-potential-of-non-llm-ai-development

#news #tech #ReinforcementLearning

**LavX News** @lavxnews@mastodon.cloud · Apr 18

Apr 18

LavX News @lavxnews@mastodon.cloud

Revolutionizing AI: Google's DeepMind Proposes Experiential Learning with 'Streams'

Google's DeepMind researchers are pushing the boundaries of artificial intelligence by introducing a new paradigm called 'streams', which allows AI models to learn from their environments without huma...

https://news.lavx.hu/article/revolutionizing-ai-google-s-deepmind-proposes-experiential-learning-with-streams

#news #tech #ReinforcementLearning

**LavX News** @lavxnews@mastodon.cloud · Apr 18

Apr 18

LavX News @lavxnews@mastodon.cloud

The AI Revolution: Rethinking Timelines and Expectations for AGI

As we stand on the brink of a new era in artificial intelligence, the focus shifts from large, generalized models to smaller, specialized agents. This article explores the potential breakthroughs in A...

https://news.lavx.hu/article/the-ai-revolution-rethinking-timelines-and-expectations-for-agi

#news #tech #GenerativeAI

**SwissNationalScienceFoundation** @snsf_ch@social.anoxinon.de · Apr 15

Apr 15

SwissNationalScienceFoundation @snsf_ch@social.anoxinon.de

#DLRL Summer School 2025 – Apply now! From 21 July to 1 August in Edmonton, #Canada.
Focus: #MachineLearning, #DeepLearning & #ReinforcementLearning
Highlight: Thanks to the SNSF-#CIFAR partnership, costs will be covered for participants from Switzerland!
Apply here https://dlrl.ca/apply/

dlrl.caApply – CIFAR DLRL Summer School

**FondsNationalSuisse** @fns_ch@social.anoxinon.de · Apr 15

Apr 15

FondsNationalSuisse @fns_ch@social.anoxinon.de

#DLRL Summer School 2025 – Postulez maintenant ! Du 21 juillet au 1er août à Edmonton au #Canada.
Zoom sur : #MachineLearning, #DeepLearning et #ReinforcementLearning
Avantage : les frais sont pris en charge (partenariat CIFAR) pour les participant·es de Suisse !
Postulez via https://dlrl.ca/fr/postulez/

dlrl.caPostulez – CIFAR DLRL Summer School

**Schweizerischer Nationalfonds** @SNF_ch@social.anoxinon.de · Apr 15

Apr 15

Schweizerischer Nationalfonds @SNF_ch@social.anoxinon.de

#DLRL Summer School 2025 – Jetzt bewerben! Vom 21. Juli bis 1. August in Edmonton, Kanada.

Fokus: #MachineLearning, #DeepLearning & #ReinforcementLearning
Vorteil: Für Teilnehmende aus der Schweiz werden die Kosten übernommen (CIFAR-Partnerschaft)!
Jetzt bewerben! https://dlrl.ca/apply/

dlrl.caApply – CIFAR DLRL Summer School

**Antonio** @qolorao@mastodon.social · Apr 15

Apr 15

Antonio @qolorao@mastodon.social

Nuestro último artículo "MELGYM: A dynamic control interface for MELCOR simulations" ha sido publicado en la revista SoftwareX.

https://www.sciencedirect.com/science/article/pii/S2352711025001153

Presentamos MELGYM, una interfaz en Python que permite el control interactivo de simulaciones con MELCOR, un código ampliamente utilizado para el análisis de seguridad en instalaciones nucleares como IFMIF-DONES.

#reinforcementlearning #ai #nuclear

**LavX News** @lavxnews@mastodon.cloud · Apr 10

Apr 10

LavX News @lavxnews@mastodon.cloud

Unlocking the Future: Global Machine Learning and AI Summer Schools

As the demand for AI and machine learning expertise skyrockets, summer schools around the globe are providing invaluable opportunities for developers and researchers. From Natural Language Processing ...

https://news.lavx.hu/article/unlocking-the-future-global-machine-learning-and-ai-summer-schools

#news #tech #MachineLearning

**Hacker News** @h4ckernews@mastodon.social · Apr 8

Apr 8

Hacker News @h4ckernews@mastodon.social

Can reinforcement learning for LLMs scale beyond math and coding tasks? Probably

https://arxiv.org/abs/2503.23829

arXiv.orgExpanding RL with Verifiable Rewards Across Diverse DomainsReinforcement learning (RL) with verifiable rewards (RLVR) has shown promising results in mathematical reasoning and coding tasks where well-structured reference answers are available. However, its applicability to broader domains remains underexplored. In this work, we study the extension of RLVR to more diverse domains such as medicine, chemistry, psychology, and economics. We observe high agreement in binary judgments across different large language models (LLMs) when objective reference answers exist, which challenges the necessity of large-scale annotation for training domain-specific reward models. To address the limitations of binary rewards when handling unstructured reference answers, we further incorporate model-based soft scoring into RLVR to improve its flexibility. Our experiments show that a distilled generative reward model can serve as an effective cross-domain verifier, providing reliable reward signals for RL without requiring domain-specific annotations. By fine-tuning a base 7B model using various RL algorithms against our reward model, we obtain policies that outperform state-of-the-art open-source aligned LLMs such as Qwen2.5-72B-Instruct and DeepSeek-R1-Distill-Qwen-32B by a large margin, across domains in free-form answer settings. This also strengthens RLVR's robustness and scalability, highlighting its potential for real-world applications with noisy or weak labels.

#HackerNews #reinforcementlearning #LLMs

**JesseTong** @natsume_shokogami@mastodon.world · Apr 6

Apr 6

JesseTong @natsume_shokogami@mastodon.world

@lianna Well, most #AIs and #robots in fiction I think their inputs are mostly or fully sensory-based, and they learn in real time through #ReinforcementLearning - esque techniques. AIs like LLMs are frozen in place (they never update and are just replaced over time), and they do not have any meanful interaction to the real world, nor like reflection.

I'd think that robots like #Sophia a few years ago would be more closer to the former than the latter, but #AIBros love conflating the twos.

**Antonio Lieto** @antoniolieto@fediscience.org · Apr 1

Apr 1

Antonio Lieto @antoniolieto@fediscience.org

Happy birthday to Cognitive Design for Artificial Minds (https://lnkd.in/gZtzwDn3) that was released 4 years ago!

Since then its ideas have been presented and discussed widely in the research fields of AI/Cognitive Science/Robotics and - nowadays - both the possibilities and the limitations of: #LLMs, #GenerativeAI and #ReinforcementLearning (already envisioned and discussed in the book) have become a common topic of research interests in the AI community and beyond.
Similarly also the topic concerning the evaluation - in human-like and human-level terms - of the current AI systems has become a critical theme related to the problem Anthropomorphic interpretation of AI output (see e.g. https://lnkd.in/dVi9Qf_k ).
Book reviews have been published on ACM Computing Reviews (2021) https://lnkd.in/dWQpJdkV and on Argumenta (2023): https://lnkd.in/derH3VKN

I have been invited to present the content of the book in over 20 official scientific events in international conferences, Ph.D Schools in US, China, Japan, Finland, Germany, Sweden, France, Brazil, Poland, Austria and, of course, Italy.

A news I am happy to share is that Routledge/Taylor & Francis contacted me few weeks ago for a second edition! Stay tuned!

The #book is available in many webstores:
- Routledge: https://lnkd.in/dPrC26p
- Taylor & Francis: https://lnkd.in/dprVF2w
- Amazon: https://lnkd.in/dC8rEzPi

@academicchatter @cognition
#AI #minimalcognitivegrid #CognitiveAI #cognitivescience #cognitivesystems

Recent searches

Search options

Administered by:

Server stats:

#reinforcementlearning