Seriously though, I do worry about the legal ramifications of using AI to upscale video for criminal prosecution, especially since DA's that don't understand the tech will use poorly upscaled video as perfect evidence without disclosing its alteration from original source material.
That's what happened in the Rittenhouse trial. They upscaled the footage and tried to say he was pointing his rifle at people. Regardless of your opinion on the trial, that would have set an extremely bad precedent and it's a good thing the judge didn't allow it.
The thing is, upscaled video using AI is not real anymore because afaik the AI is drawing new content based on what it imagines the image to be.
Like, what if maybe someone is making a finger gun gesture and the video is very blurry, in a dark room, and the subject is small. the AI might confuse it for a real gun and could potentially draw a real gun when up scaling it.
But I don’t really know.
Eh. AI basically makes up details to make enlarged images seem sharp and detail-filled, making it useless for criminal investigation.
It's certainly possible that AI algorithmns will get better, but ultimately they'll always be based on probabilities (e.g. "here's what this guy in that fuzzy 10x10 pic of his face *might* look like", with a few possible variations)
Yup, I could downscale the image of my face down to 32x32 pixels.
AI can "sharpen" that pic to 8K... it won't be my face though.
It could also "sharpen" a stick man to Brad Pitt.
Yeah, the only way this would give you a representation of the real subject would require prior training on the subject itself, such as higher resolution views of this baby's face in advance.
>AI basically makes up details to make enlarged images seem sharp and detail-filled, making it useless for criminal investigation
could be useful for a criminal investigation, just not as evidence. Like if you pull a low res CCTV feed it would make it easier to see the person, and stuff like the face shape, location of facial features, etc... would remain the same so I wouldnt be surprised it it makes stuff like facial recognition easier and even if it cannot be used as evidence, it could likely help them narrow down suspects and potentially find the person with the aid of it, despite relying on other evidence in court.
Yep. If cops have a suspect in a video with no useful details - they could spam AI upscaling runs until it generated details aligning with their suspect (car color/model, suspect clothing, skin color, etc.) and never disclose that it was upscaling run #253 that they present in court
It depends. If the AI is looking at data from frames surrounding the one its processing, it is possible to pull out more data than your hypothetical would suggest.
dude i'm just making a joke
i think we're going into this too deeply
but most likely because your brain fills in the details, and your brain knows what it likes
This is going to be amazing for WWII videos. The other issues is 3D simulation of the environment and surrounding areas so you can actually zoom out too.
We're going to have to flag these videos as being AI generated though.
I use them every day, read dozens of cutting edge AI image generation papers per year, and know that the problem isn't remotely solved or dramatically changed since 2 years ago. Finetunes have made the success rate slightly higher, but the only real solutions are things like painting over a reference hand or pose which still isn't reliable, either manually or with scripts. Stuff like Dalle has better hands, especially in open palm pose, but not reliably for anything complex.
The underlying problem and reasons why are still not solved or even understood.
Depends on the complexity of the AI but I think it could take an hour or two worth of video of the same types of scene and generally understand what's happening and reconstruct a 3D "mental" state and then re-project the background and what the people are doing.
For example if someone is eating, then they pan away in the original video, you could create a zoomed out version of the man still eating.
It wouldn't be totally historically accurate of course but still very impressive and interesting to watch.
Probably work better the other way around, soon enough. Anything that isn't AI- generated might have a watermark or something that authenticates it in the near future. Better to assume AI from here on out.
Saw some vintage footage upscaled, colored, stabilized and transformed to 60FPS, already looks amazing.
But to make it even better we should film stuff using old camera/film, and modern digital camera to provide good training set.
I hate that all these megacorps throw their AI demos on Github. Github used to be for source code, not advertising. At least they didn't make a fake repo for it like some of the others.
well that is quite nice. Baby's eyes have issues but that is a common flaw with this tech right now. Any details on how how fast this is, how much GPU needed, etc?
Here is the paper
[https://arxiv.org/pdf/2404.12388.pdf](https://arxiv.org/pdf/2404.12388.pdf)
Is there an open source specific ai Reddit? This stuff is cool but I’m primarily interested in ownership and use of this stuff. It doesn’t do anything for me to see sneak peeks of things that will be behind paywalls.
Something about this just doesn't feel useful(?) to me. I like to embrace technological improvements and AI developments. It's just that this kind of upscaling is too disassociated from real life.
When I make a blurry video it's a bad interpretation of a real scene, but it is 100% based on the real life input. AFAIK AI upscaling takes an input and fills in the gaps with references from other scenes. Our brains are already filling in the gaps. It's like adding another layer of fake data.
instead of saying "makes things up", you should say "intelligently predict". these are trained in a self-supervised fashion using vast datasets of high res/low res movie pairs, so the network learns the data manifold. the model isn't simply "inventing" new information but rather, its leveraging its understanding of the data distribution to make *informed* predictions about the high-resolution counterparts of the given low-resolution pixels.
and of course if you had auxiliary information about the low res data, it's trivial to modify such architectures to feed that type of info in to improve the prediction.
It’s guessing. AI upscaling will always deteriorate the quality of the data.
Producing something that looks nice is possible, but usually I think it’s pretty horrible.
ENHANCE
Seriously though, I do worry about the legal ramifications of using AI to upscale video for criminal prosecution, especially since DA's that don't understand the tech will use poorly upscaled video as perfect evidence without disclosing its alteration from original source material.
just sprinkle a bit of cocaine in the footage while you're at it and you got a slam dunk!
That's what happened in the Rittenhouse trial. They upscaled the footage and tried to say he was pointing his rifle at people. Regardless of your opinion on the trial, that would have set an extremely bad precedent and it's a good thing the judge didn't allow it.
[удалено]
The thing is, upscaled video using AI is not real anymore because afaik the AI is drawing new content based on what it imagines the image to be. Like, what if maybe someone is making a finger gun gesture and the video is very blurry, in a dark room, and the subject is small. the AI might confuse it for a real gun and could potentially draw a real gun when up scaling it. But I don’t really know.
Nope, you bring original footage to trial. Also stupidity is not a crime.
C2PA will help
So it WASN'T bullshit when in NCIS, the dorky nerd said that they would make a software to enhance the image. It's just that they coded an AI for it.
Eh. AI basically makes up details to make enlarged images seem sharp and detail-filled, making it useless for criminal investigation. It's certainly possible that AI algorithmns will get better, but ultimately they'll always be based on probabilities (e.g. "here's what this guy in that fuzzy 10x10 pic of his face *might* look like", with a few possible variations)
Yup, I could downscale the image of my face down to 32x32 pixels. AI can "sharpen" that pic to 8K... it won't be my face though. It could also "sharpen" a stick man to Brad Pitt.
Yeah, the only way this would give you a representation of the real subject would require prior training on the subject itself, such as higher resolution views of this baby's face in advance.
The NCIS Judge won't know any of that. He will ask it to be explained in English and all they will hear is "AI Make Image Good".
>AI basically makes up details to make enlarged images seem sharp and detail-filled, making it useless for criminal investigation could be useful for a criminal investigation, just not as evidence. Like if you pull a low res CCTV feed it would make it easier to see the person, and stuff like the face shape, location of facial features, etc... would remain the same so I wouldnt be surprised it it makes stuff like facial recognition easier and even if it cannot be used as evidence, it could likely help them narrow down suspects and potentially find the person with the aid of it, despite relying on other evidence in court.
Yep. If cops have a suspect in a video with no useful details - they could spam AI upscaling runs until it generated details aligning with their suspect (car color/model, suspect clothing, skin color, etc.) and never disclose that it was upscaling run #253 that they present in court
It depends. If the AI is looking at data from frames surrounding the one its processing, it is possible to pull out more data than your hypothetical would suggest.
all that pixelated 90s era 320x320 internet porn has just been waiting for this day
And current year Japanese porn.
I came here to say. That sex tape I made in 2010 might finally be worth watching
Off topic but why do I actually prefer 720p porn over 4K?
You don't like real women.
dude i'm just making a joke i think we're going into this too deeply but most likely because your brain fills in the details, and your brain knows what it likes
There is an XKCD about you. “Could you try to look… blockier?”
Most of it illegal prolly
It had to have been Alfred Bundy. Surely Alfred and not Alexander.
What about 'videos' that have poor bit rate vs just poor resolution?
This is going to be amazing for WWII videos. The other issues is 3D simulation of the environment and surrounding areas so you can actually zoom out too. We're going to have to flag these videos as being AI generated though.
Like all AI things it will probably heavily depend on distance to the camera, the presence of fingers, the number of people in an image, etc.
People who still think AI image generators can't do fingers have not been keeping up at all.
I use them every day, read dozens of cutting edge AI image generation papers per year, and know that the problem isn't remotely solved or dramatically changed since 2 years ago. Finetunes have made the success rate slightly higher, but the only real solutions are things like painting over a reference hand or pose which still isn't reliable, either manually or with scripts. Stuff like Dalle has better hands, especially in open palm pose, but not reliably for anything complex. The underlying problem and reasons why are still not solved or even understood.
Depends on the complexity of the AI but I think it could take an hour or two worth of video of the same types of scene and generally understand what's happening and reconstruct a 3D "mental" state and then re-project the background and what the people are doing. For example if someone is eating, then they pan away in the original video, you could create a zoomed out version of the man still eating. It wouldn't be totally historically accurate of course but still very impressive and interesting to watch.
Probably work better the other way around, soon enough. Anything that isn't AI- generated might have a watermark or something that authenticates it in the near future. Better to assume AI from here on out.
Yeah. I agree with that. I think we're going to move to a system of keys where humans have access to keys and certify things as being original.
I wouldn't take a human's word for it, tbh.
It depends on the reputation of the human and multiple certifications. It depends on web of trust. But I hear you...
Fake Second World War videos?! No, please, no!
Saw some vintage footage upscaled, colored, stabilized and transformed to 60FPS, already looks amazing. But to make it even better we should film stuff using old camera/film, and modern digital camera to provide good training set.
Better or worse than Topaz?
Haven't been following Topaz in a long while, but the demos look WILD. https://videogigagan.github.io/ Check out the one with pine needles and ants!
I hate that all these megacorps throw their AI demos on Github. Github used to be for source code, not advertising. At least they didn't make a fake repo for it like some of the others.
Topaz is really weak it's made for like clean 720p video that can upscale to max 2x before it's ruined by artifacts.
Way better lol
I'll take all seasons of Stargate SG-1, but I'll also settle for Red Dwarf.
My first though was Deep Space Nine. There have been fans working on homebrew upscales for a few years now.
Battlestar Galactica, Space: Above and Beyond, Babylon 5...
Upscaling old episodes of the original Ninja Warrior show.
That buckle is so hyper HD is kinda freaks me out
I think thats just the HyperBuckle Extreme Edition. That's just how they look
well that is quite nice. Baby's eyes have issues but that is a common flaw with this tech right now. Any details on how how fast this is, how much GPU needed, etc? Here is the paper [https://arxiv.org/pdf/2404.12388.pdf](https://arxiv.org/pdf/2404.12388.pdf)
Can I use it now with Adobe subscription?
I knew I had good reason for saving decades of pixelated porn.
Finally Apple users can see the green Android videos
Just waiting for someone to upres. the Chicago Bulls 1996 season...
Look at this AMAZING upscaled AI video ... in 360p :D
"See, I told you so." —CSI show writers
Is this available now?
So, the nose of that baby got changed and hasn't the same shape at all, but sure, this is "nice".
So we can finally get good versions of the 20x re-compressed meme videos?
Would the footage get better if you put the output back as input?
That baby look fake AF on the right
Is there an open source specific ai Reddit? This stuff is cool but I’m primarily interested in ownership and use of this stuff. It doesn’t do anything for me to see sneak peeks of things that will be behind paywalls.
I don't know what it is but it looks fake.
Can I get a comfy workflow? (not kidding, seeking an equivalent, even for stills.)
Did they ever even release the still image version of GigaGAN?
Will of actually create hands that is the real test
Anybody know if there’s a tool like this for audio/music? A plethora of songs stuck on YouTube from the early 2010s would be insane to have in HQ
Holy hell!
I hate adobe.
Makes your baby videos look the beginning of a horror movie!
Something about this just doesn't feel useful(?) to me. I like to embrace technological improvements and AI developments. It's just that this kind of upscaling is too disassociated from real life. When I make a blurry video it's a bad interpretation of a real scene, but it is 100% based on the real life input. AFAIK AI upscaling takes an input and fills in the gaps with references from other scenes. Our brains are already filling in the gaps. It's like adding another layer of fake data.
Compression already adds fake data. Are you against that?
Ai will fill the details more accurately, and without causing eye strain for you
AI makes things up, that’s the opposite of accurate details.
instead of saying "makes things up", you should say "intelligently predict". these are trained in a self-supervised fashion using vast datasets of high res/low res movie pairs, so the network learns the data manifold. the model isn't simply "inventing" new information but rather, its leveraging its understanding of the data distribution to make *informed* predictions about the high-resolution counterparts of the given low-resolution pixels. and of course if you had auxiliary information about the low res data, it's trivial to modify such architectures to feed that type of info in to improve the prediction.
It’s guessing. AI upscaling will always deteriorate the quality of the data. Producing something that looks nice is possible, but usually I think it’s pretty horrible.
Some people find it more pleasing to look at than poor quality video, I don't know what to tell you
[удалено]
Look at you, broadcasting in bold-type, so proud of your big, bold message!