mathstodon.xyz is one of the many independent Mastodon servers you can use to participate in the fediverse.
A Mastodon instance for maths people. We have LaTeX rendering in the web interface!

Server stats:

2.7K
active users

#worldmodels

0 posts0 participants0 posts today
Tero Keski-Valkama<p>How to formulate exploration-exploitation trade-off better than all the hacks on top of Bellman equation?</p><p>We can first of all simply estimate the advantage of exploration by Monte-Carlo in a swarm setting: Pitting fully exploitative agents against fully exploitative agents which have the benefit of recent exploration. This can be easily done by lagging policy models.</p><p>Of course the advantage of exploration needs to be divided by the cost of exploration, which is linear to the number of agents used in the swarm to explore at a particular state.</p><p>Note that the advantage of exploration depends on the state of the agent, so we might want to define an explorative critic to estimate this.</p><p>What's beautiful in this formulation is that we can incorporate autoregressive <a href="https://rukii.net/tags/WorldModels" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WorldModels</span></a> naturally, as the exploitative agents only learn from rewards, but the explorative agents choose their actions in a way which maximizes the improvement of the auto-regressive World Model.</p><p>It brings these two concepts together as sides of the same coin.</p><p>Exploitation is reward-guided action, exploration is auto-regressive state transition model improvement guided action.</p><p>Balancing the two is a swarm dynamic which encourages branching where exploration has an expected value in reward terms. This can be estimated by computing the advantage of exploitative agents utilizing recent exploration versus agents which do not, and returning this advantage to the points of divergence between the two.</p><p><a href="https://rukii.net/tags/mathematics" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>mathematics</span></a> <a href="https://rukii.net/tags/ReinforcementLearning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ReinforcementLearning</span></a> <a href="https://rukii.net/tags/RL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>RL</span></a> <a href="https://rukii.net/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://rukii.net/tags/LLMs" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>LLMs</span></a></p>
LavX News<p>Google DeepMind's Bold Leap into AI World Simulation</p><p>In a groundbreaking move, Google DeepMind is assembling a new team led by Tim Brooks to develop AI models capable of simulating the physical world. This initiative aims to push the boundaries of artif...</p><p><a href="https://news.lavx.hu/article/google-deepmind-s-bold-leap-into-ai-world-simulation" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://</span><span class="ellipsis">news.lavx.hu/article/google-de</span><span class="invisible">epmind-s-bold-leap-into-ai-world-simulation</span></a></p><p><a href="https://mastodon.cloud/tags/news" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>news</span></a> <a href="https://mastodon.cloud/tags/tech" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>tech</span></a> <a href="https://mastodon.cloud/tags/ArtificialIntelligence" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ArtificialIntelligence</span></a> <a href="https://mastodon.cloud/tags/WorldModels" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WorldModels</span></a> <a href="https://mastodon.cloud/tags/GoogleDeepMind" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GoogleDeepMind</span></a></p>
LavX News<p>Google DeepMind's Bold Leap into AI World Simulation</p><p>In a groundbreaking move, Google DeepMind is assembling a new team led by Tim Brooks to develop AI models capable of simulating the physical world. This initiative aims to push the boundaries of artif...</p><p><a href="https://news.lavx.hu/article/google-deepmind-s-bold-leap-into-ai-world-simulation" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">news.lavx.hu/article/google-de</span><span class="invisible">epmind-s-bold-leap-into-ai-world-simulation</span></a></p><p><a href="https://mastodon.social/tags/news" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>news</span></a> <a href="https://mastodon.social/tags/tech" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>tech</span></a> <a href="https://mastodon.social/tags/ArtificialIntelligence" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ArtificialIntelligence</span></a> <a href="https://mastodon.social/tags/WorldModels" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WorldModels</span></a> <a href="https://mastodon.social/tags/GoogleDeepMind" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GoogleDeepMind</span></a></p>
Winbuzzer<p>Google DeepMind is creating a new team to develop AI world models, focusing on simulating real-world dynamics and AGI <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/DeepMind" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DeepMind</span></a> <a href="https://mastodon.social/tags/GoogleDeepMind" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GoogleDeepMind</span></a> <a href="https://mastodon.social/tags/WorldModels" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WorldModels</span></a> <a href="https://mastodon.social/tags/GenAI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GenAI</span></a> <a href="https://mastodon.social/tags/AGI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AGI</span></a> <a href="https://mastodon.social/tags/Robotics" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Robotics</span></a> <a href="https://mastodon.social/tags/Gaming" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Gaming</span></a> <a href="https://mastodon.social/tags/AIResearch" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AIResearch</span></a> <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MachineLearning</span></a> <a href="https://mastodon.social/tags/AIModels" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AIModels</span></a> <a href="https://mastodon.social/tags/Alphabet" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Alphabet</span></a></p><p><a href="https://winbuzzer.com/2025/01/08/google-deepmind-forms-specialized-team-for-ai-world-models-xcxwbn" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">winbuzzer.com/2025/01/08/googl</span><span class="invisible">e-deepmind-forms-specialized-team-for-ai-world-models-xcxwbn</span></a></p>
PKPs Powerfromspace1<p>@wesroth </p><p>Ep 12-20-2024 🤖📰😅</p><p>Watch " <a href="https://mstdn.social/tags/Genesis" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Genesis</span></a> Project Just UNLEASHED Legions of <a href="https://mstdn.social/tags/Robots" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Robots</span></a> from SIMULATION to REALITY..."</p><p><a href="https://youtu.be/IAmrSaDW88I?feature=shared" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">youtu.be/IAmrSaDW88I?feature=s</span><span class="invisible">hared</span></a></p><p><a href="https://mstdn.social/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ai</span></a> <a href="https://mstdn.social/tags/llm" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>llm</span></a> <a href="https://mstdn.social/tags/robotics" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>robotics</span></a> <a href="https://mstdn.social/tags/worldmodels" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>worldmodels</span></a> combined with <a href="https://mstdn.social/tags/agi" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>agi</span></a> <a href="https://mstdn.social/tags/o3" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>o3</span></a><br>Coming soon 🔜 to <a href="https://mstdn.social/tags/skynet" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>skynet</span></a> v0.1 😈</p>
Joe GANIO-MEGO<p><span class="h-card" translate="no"><a href="https://xn--baw-joa.social/@unikonstanz" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>unikonstanz</span></a></span> </p><p>Pretty cool initiative!!!</p><p>If anybody is looking for inspirations about subjects feel free to use and are re-use any of my models about human population and competitiveness. Totally open science models with code listings available here:</p><p><a href="https://osf.io/mg82f/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">osf.io/mg82f/</span><span class="invisible"></span></a></p><p><a href="https://mastodon.social/tags/openscience" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>openscience</span></a> <br><a href="https://mastodon.social/tags/worldmodels" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>worldmodels</span></a> <br><a href="https://mastodon.social/tags/population" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>population</span></a></p>
Michal Valko<p>BYOL-Explore getting the apple! <a href="https://www.deepmind.com/publications/byol-explore-exploration-by-bootstrapped-prediction" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">deepmind.com/publications/byol</span><span class="invisible">-explore-exploration-by-bootstrapped-prediction</span></a> w/ Zhaohan Daniel Guo, Shantanu Thakoor, Miruna Pislar, Corentin Tallec, Florent Altché, Bernardo Avila Pires, Robin (Yunhao) Tang, Alaa Saade, Jean-Bastien Grill, Mohammad Gheshlaghi Azar, Bilal Piot, Remi Munos, Daniele Calandriello <a href="https://mastodon.social/tags/neurips2022" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>neurips2022</span></a> <a href="https://mastodon.social/tags/worldmodels" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>worldmodels</span></a> <a href="https://mastodon.social/tags/reinforcementlearning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>reinforcementlearning</span></a> <a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ai</span></a></p>
Shiwali Mohan | शिवाली मोहन<p>However, as soon as you start thinking about <a href="https://sigmoid.social/tags/language" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>language</span></a> as communication between <a href="https://sigmoid.social/tags/intelligent" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>intelligent</span></a> <a href="https://sigmoid.social/tags/agents" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>agents</span></a>, you have to start thinking about how language is connected with <a href="https://sigmoid.social/tags/worldmodels" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>worldmodels</span></a> </p><p>Lawrence Barsalou wrote about this phenomenon in 1999 while critiquing <a href="https://sigmoid.social/tags/language" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>language</span></a> <a href="https://sigmoid.social/tags/research" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>research</span></a> in <a href="https://sigmoid.social/tags/psychology" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>psychology</span></a>. He argued that <a href="https://sigmoid.social/tags/language" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>language</span></a> evolved in humans for co-coordinating collaborative actions in human teams. But, <a href="https://sigmoid.social/tags/psycholinguistics" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>psycholinguistics</span></a> approached language as if its primary function was archival.</p><p><a href="https://web.archive.org/web/20031206190916id_/http://userwww.service.emory.edu:80/~barsalou/Papers/Disc_Proc_Files/disc_proc_98.pdf" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://</span><span class="ellipsis">web.archive.org/web/2003120619</span><span class="invisible">0916id_/http://userwww.service.emory.edu:80/~barsalou/Papers/Disc_Proc_Files/disc_proc_98.pdf</span></a></p>
Manuel Baltieri<p><a href="https://mathstodon.xyz/tags/Introduction" class="mention hashtag" rel="tag">#<span>Introduction</span></a> time.</p><p>TL;dr: maths to formally study <a href="https://mathstodon.xyz/tags/agents" class="mention hashtag" rel="tag">#<span>agents</span></a>, <a href="https://mathstodon.xyz/tags/worldmodels" class="mention hashtag" rel="tag">#<span>worldmodels</span></a> and <a href="https://mathstodon.xyz/tags/representations" class="mention hashtag" rel="tag">#<span>representations</span></a></p><p>I&#39;m currently a Chief Researcher at Araya, a Tokyo based startup with the goal to understand (artificial) <a href="https://mathstodon.xyz/tags/consciousness" class="mention hashtag" rel="tag">#<span>consciousness</span></a>.</p><p>My work focuses on general principles for the <a href="https://mathstodon.xyz/tags/origins" class="mention hashtag" rel="tag">#<span>origins</span></a> of <a href="https://mathstodon.xyz/tags/agency" class="mention hashtag" rel="tag">#<span>agency</span></a>, <a href="https://mathstodon.xyz/tags/life" class="mention hashtag" rel="tag">#<span>life</span></a> and <a href="https://mathstodon.xyz/tags/cognition" class="mention hashtag" rel="tag">#<span>cognition</span></a>. I previously used the <a href="https://mathstodon.xyz/tags/freeenergyprinciple" class="mention hashtag" rel="tag">#<span>freeenergyprinciple</span></a> (+ <a href="https://mathstodon.xyz/tags/activeinference" class="mention hashtag" rel="tag">#<span>activeinference</span></a>) as my main framework, but I&#39;m now looking into other directions, mostly using <a href="https://mathstodon.xyz/tags/categorytheory" class="mention hashtag" rel="tag">#<span>categorytheory</span></a> applied to <a href="https://mathstodon.xyz/tags/systemstheory" class="mention hashtag" rel="tag">#<span>systemstheory</span></a>.</p>
Michal Valko<p>And now it comes, presenting BYOL-Explore! <a href="https://arxiv.org/abs/2206.08332" rel="nofollow noopener noreferrer" target="_blank"><span class="invisible">https://</span><span class="">arxiv.org/abs/2206.08332</span><span class="invisible"></span></a><br>With Zhaohan Daniel Guo, Shantanu Thakoor, Miruna Pislar, Bernardo Ávila Pires, Florent Altché, Corentin Tallec, Alaa Saade, Daniele Calandriello, Jean-Bastien Grill, Robin (Yunhao) Tang, Rémi Munos, Mohammad Gheshlaghi Azar, Bilal Piot Comenius University in Bratislava DeepMind Photo (c) Stanislav Griguš <a href="https://mastodon.social/tags/artificialintelligence" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>artificialintelligence</span></a> <a href="https://mastodon.social/tags/worldmodels" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>worldmodels</span></a> <a href="https://mastodon.social/tags/reinforcementlearning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>reinforcementlearning</span></a></p>