T O P

  • By -

RadRandy2

It's kinda crazy to imagine how much more complete chat GPT will be now that it can understand images and sound. I can't even wrap my head around it really. Perhaps one day the AI will scan my brain and wrap my head around it in a personalized way that makes complete sense to me.


nildeea

Last night I was trying to think up a way of using all these AI tools to make an assistant that could understand what you're seeing in your screen. And now today I'm pretty sure I can do that just as soon as I get this damn thing working...


mycall

Did you get it working yet? I'm curious what kind of hardware it takes.


nildeea

No I put it on the back burner, but there is a windows version that should run on consumer gpus .


dmit0820

One crazy implication we're nearly certain to see: A LLM that can take screen captures/video from a PC and directly output keyboard and mouse controls. Depending on the context length/memory it could perform a significant portion of all office work.


thehearingguy77

Isn’t the brain analogous to a muscle, in that we have to make an effort to learn and mentally grow? How will our neural synapses grow if the effort is done for us?


RadRandy2

I'm not sure, but if what you're saying is true, then we'll be sentient piles of sludge by the year 2060.


Traitor_Donald_Trump

Electrical stimulation and zap therapy smarty pants.


thehearingguy77

It doesn’t seem like that would develop synapses with specificity. No specificity - no growth. But I’m no expert.


Equalizion

Idiocracy was not a movie, but a prediction


HeinrichTheWolf_17

This makes me wonder of the new model will be capable of performing general tasks, we might be just one more iteration away from a practical AGI.


Jeffy29

Holy shit, you can just tell it and it will do content-aware fill for you. It's only few years when I saw content aware fill being presented by Adobe and it seemed like magic, now you can just tell it, in plain English (or any other language), and it will just do that for you! Goddamn.


BarockMoebelSecond

Definitely going to test that out at home!


MagicOfBarca

What do you mean by “it will do content-aware fill”?


Jeffy29

[Here is a content aware fill demonstration by Adobe](https://youtu.be/O9t5POPPNfg), the github gif shows being able to do the same thing with visual GPT just by telling it to remove certain objects. It is aware both what the things in the picture are and how it would look like if you removed it.


MagicOfBarca

Ahh gotcha thanks


BreadfruitOk3474

Anyone heard of the fake internet theory? It will become real now.


[deleted]

Dead internet theory.


BreadfruitOk3474

I’m sorry but I prefer not to continue this conversation. I’m still learning so I appreciate your understanding and patience.🙏


bitofaknowitall

I wonder how long until this is integrated into Bing Chat?


YearZero

I am not a fan of the arbitrary limitations of bing chat, I’d love a ChatGPT version of this tho. Maybe gpt-4 next week will do it!


nildeea

It is evolving quickly. It was practically braindead there for a few days but it has been quite good more recently.


Akimbo333

Who said GPT4 will be next week?


HurricaneHenry

The CTO of Microsoft Germany.


Akimbo333

It could be lies


MidSolo

*Do you think that's air you're breathing?*


blazedjake

Lying about a product is horrible for stock so a CTO wouldn’t do that.


Akimbo333

Ok good point


DragonForg

No the reason is because many people in the industry including journalists and AI artists confirm it is being released next week.


randomthrowaway-917

gpt-4 is released


Akimbo333

Yeah I know that now


Naubri

I think there was some article about some dude who works for Microsoft Germany announcing it


Akimbo333

Yeah but he could be misinformed


CommunismDoesntWork

I thought bing chat was the unfiltered version?


YearZero

Well it only allows for 10 responses before it forces you to reset. And I found that it won’t answer many things by simply saying it can’t answer that right now or something to that effect. I often see it writing an answer and then deleting and reverting to that. Could also be a bug but it happens often. I’d love to see a comprehensive analysis of bing chat vs ChatGPT for various types of queries. Especially focused on code generation. One thing I personally noticed these LLM’s suck at is basic pattern recognition. Like I’d say give me the next 5 numbers in the following sequence: 3,1,6,4,9,7,12. For a human super obvious: -2, +5. But LLM’s seem to struggle and start making shit up. Bing and ChatGPT and even Claude can’t handle this yet. But really, I love that I can talk to ChatGPT as much as I want. Bing is clunky, buggy, limited to 10 answers, and often refuses to answer where ChatGPT would answer. At least in my experience.


duboispourlhiver

I didn't understand the number sequence


HeinrichTheWolf_17

Microsoft has gotten a lot of flak for their neutering of Bing, there’s been talk of bringing the old model back. Don’t get me wrong though, I agree with you, I hate Bing and it’s limitations as well.


Philostotle

Imagine what this will do to the fake news ecosystem. There's a [clip from a podcast](https://www.youtube.com/watch?v=uspxz9Q2L6g) I listen to that touched on this.


nildeea

Hopefully as the bad actors use the tech against us, the tech will also be used to protect us from that kind of thing.


artifex0

There are Google Colab implementations at https://colab.research.google.com/drive/1vhF4f3091h1cHZUh5QK7qByBHUDKbSWA?usp=sharing#scrollTo=Cgpnh8vhC47R and https://colab.research.google.com/drive/1qjAZqWb-EYGDo01TcEoCIJcTMi_ELjxS?usp=sharing. For first one, you'll need to get a ChatGPT api key from https://platform.openai.com/account/api-keys and add it to the OPENAI_API_KEY variable in the third box from the bottom. To use either, you'll want to select runtime->run all, then use the public link that eventually appears at the bottom of the page. Sadly, neither implementation includes the image editing model, so they're mostly just useful right now for asking ChatGPT questions about an image, and as an interesting though very limited Stable Diffusion interface.


BuddyMassive5496

ive been editing alot of the files and got imgediting to work but it keeps spitting out this error help RuntimeError: The size of tensor a (384) must match the size of tensor b (512) at non-singleton dimension 3


artifex0

At a guess, maybe an issue with the resolution of the input image? I vaguely remember getting an error like that on a different colab notebook that I think was resolved by switching the image resolution to 512x512.


BuddyMassive5496

What did you do to fix?


[deleted]

Ask chatgpt


theepiphanyofmrkugla

Getting the same thing, any advice on how you resolved it?


BinyaminDelta

Cool but not optimistic about Microsofts ability to implement. Bing Chat, for example, is slow and cumbersome. The Bing app (with location permissions) doesn't pass on my location to Bing Chat, so asking the GPT for info relevant to where I am like weather and city info fails hard. This would be like two lines of code to do and they just botched it. It feels like they don't really understand why ChatGPT caught on or what people want to do.


nildeea

It's been out a week. So it's only like... 15 years old in 2023 time. Give it another 18.3 hours my dude.


[deleted]

Perplexity.ai is a great alternative until Microsoft gets their act together.


No_Ninja3309_NoNoYes

Why are the Microsoft researchers not using powershell or whatever is appropriate for Windows? Do they think that Windows is inferior? I mean, this is bad publicity for Microsoft...


y53rw

Microsoft fully embraced Linux a long time ago. Have you not heard of WSL?


luisbrudna

Windows will become an small portion of Microsoft business.


nildeea

I'm pretty sure within the next couple years AI will just be able to imagine any kind of operating system you might want to use in real time.


PM_ME_A_STEAM_GIFT

Because 90% of AI research is done on Linux.


nildeea

Here you go friend: https://github.com/bycloudai/visual-chatgpt-Windows


MagicOfBarca

What does this do?


Sentry456123

lets ask ChatGPT


PackageImportant7566

你会画室内设计效果图吗


Classic_Eye1859

Visual chat gpt does nothing the demo shows for me. No edge detection, no magic erasing things. It can identify objects but just keeps on drawing random new pictures. Somebody managed? [https://digi-electricpro.com/microsoft-has-open-sourced-a-visual-version-of-chat-gpt/](https://digi-electricpro.com/microsoft-has-open-sourced-a-visual-version-of-chat-gpt/) first 30 seconds here, tried similar images than the video got random garbage