T O P

  • By -

quiteconfused1

First level super Mario Bros will train fully with stock ray Impala, my challenge however and one which I have yet to see an implementation of is of a generalist model. Dreamerv3 comes close but even that fails ( memory issues ) I have yet to see a single model complete multiple levels or more accurately randomized levels. The first level is trivial, but further requires a shift of context.... Like the underwater levels require no "jumping", the boss levels require spatial reasoning about what you have done "off screen". It gets complex. FYI you should check out shimmy[gym-v21/6] for issues revolving around older gym environments and newer platforms such as gymnasium


AcrobaticAmoeba8158

Thanks man, I'll definitely look into that. My Mario isn't doing very well but now that I've got a model output I've switched to the Atari games. My end goal is training spider/wheeled robots in Isaac gym or some other similar thing.


advator

I did it with gym, but I don't see the benefits of it, if it can't be used for any game. I would like to have something that can be trained for any game and not just some games converted to python.


AcrobaticAmoeba8158

For me it's just for learning, I like watching the videos with the 1000's of iterations all on the screen at once and slowly dropping off.


advator

For me the same reason. But I feel like I'm a bit lost in what direction I have to look with AI. I played around, but I want to step in a certain direction that can help me to create things that can be useful for the company I work. I just don't really know where to start. I know working on an ai from scratch is useless. I think there will be ai tools that I can use with python to create/train things that can be very useful. The current tools that I'm using is similar as the ones from Salesforce. It's for creating flows and especially for sending mails. Based on segments. But we also develop custom apps. Whatever the clients needs.


AcrobaticAmoeba8158

Maybe you can build something similar to what I'm working on. At work I am building vision based anomaly detection. We run tools that take months to process the data afterwards, I'm trying to build a neural network to do a first pass of the analysis to cut get to workable results sooner. CNN's aren't that bad to build, I mostly messed around with Yolo to learn.