Difference between revisions of "Video/Image"

From
Jump to: navigation, search
m (MidJourney)
m (Video Synthesis)
Line 182: Line 182:
 
= <span id="Video Synthesis"></span>Video Synthesis =
 
= <span id="Video Synthesis"></span>Video Synthesis =
  
 +
* [https://www.synthesia.io/ Synthesia] ... Create professional videos without mics, cameras, or actors,
 +
turn your text into high-quality videos with AI avatars and voiceovers — in over 120 languages.
 
* [https://www.papercup.com/ Papercup] ... go global with your existing video content using AI dubbing.
 
* [https://www.papercup.com/ Papercup] ... go global with your existing video content using AI dubbing.
 
* [https://arstechnica.com/gaming/2016/06/an-ai-wrote-this-movie-and-its-strangely-moving/ Sunspring - Movie written by algorithm]
 
* [https://arstechnica.com/gaming/2016/06/an-ai-wrote-this-movie-and-its-strangely-moving/ Sunspring - Movie written by algorithm]
Line 238: Line 240:
 
<youtube>5zlcXTCpQqM</youtube>
 
<youtube>5zlcXTCpQqM</youtube>
 
<youtube>8AZBuyEuDqc</youtube>
 
<youtube>8AZBuyEuDqc</youtube>
 
  
 
= <span id="Video Editing"></span>Video Editing =
 
= <span id="Video Editing"></span>Video Editing =

Revision as of 14:54, 9 August 2023

YouTube ... Quora ...Google search ...Google News ...Bing News


Look here

Text to Image

Firefly

Youtube search... ...Google search ...Google search

YouTube ... Quora ...Google search ...Google News ...Bing News

Adobe Firefly is a creative generative AI engine that is part of Adobe Sensei’s Generative AI services. It is available in Photoshop (beta), Illustrator, Adobe Express, and on the web. With Firefly, you can dream it, type it, and see it. You can use text prompts to generate custom vectors, brushes, textures, images, videos, and 3D objectsFirefly includes various image modification tools such as content type, color, tone, lighting, and composition.

  • Generate Fill ... Use a brush to remove objects, or paint in new ones from text descriptions

  • Text Effects ... Apply styles or textures to text with a text prompt.

  • Generative recolor ... Generate color variations of your vector artwork from a detailed text description.

Stable Diffusion

Youtube search... ...Google search ...Google search

YouTube ... Quora ...Google search ...Google News ...Bing News

A latent text-to-image diffusion model capable of generating photo-realistic images given any text input. Designing and implementing solutions using collective intelligence and augmented technology.

DALL-E

Youtube search... ...Google search ...Google search

DALL·E is a AI system that can create realistic images and art from a description in natural language. We currently support the ability, given a prommpt, to create a new image with a certain size, edit an existing image, or create variations of a user provided image.

The current DALL·E model available through our API is the 2nd iteration of DALL·E with more realistic, accurate, and 4x greater resolution images than the original model. You can try it through the our Labs interface or via the API.


MidJourney

Youtube search... ...Google search ...Google search

Runway Now Turns Midjourney Images into Videos: Generative video darling Runway has released a Gen 2 update with a feature that has really captured AI Twitter’s attention: the ability to automatically animate Midjourney images. This was a feature Pika Labs had also previously released, showing how competition is driving things forward.

Image GPT

Youtube search... ...Google search ...Google search

NVIDIA Canvas

Image to Image

  • Extrapolate ... See how well you age with AI, curious how you'll look in 10 years? 20 years? When you're 90? Upload a photo and find out!


  • Scribble Diffusion ... turns any hand sketches into images; sketch anything you want, provide a small description

Video Synthesis

  • Synthesia ... Create professional videos without mics, cameras, or actors,

turn your text into high-quality videos with AI avatars and voiceovers — in over 120 languages.



Text-to-Video

Picasso

Picasso | NVIDIA a cloud service that allows you to build generative AI-powered visual apps, is available. Software creators, service providers, and enterprises can run inference on models, train NVIDIA Edify foundation model models on proprietary data, and start from pre-trained models to create image, video, or 3D content from text prompts.



Watch Me Forever

“Watch Me Forever” is a channel on Twitch that streams AI-generated content. One of their streams is an AI-generated version of Seinfeld. The Twitch channel uses generative artificial intelligence to create an infinite stream of content. For example, their AI-generated version of Seinfeld called “Nothing, Forever” uses text generation from OpenAI’s GPT-3 models and speech from Azure Cognitive Services. They also have proprietary generative algorithms that they collectively call the ‘director,’ which is responsible for making sure all the individual pieces come together into a whole.


Image-to-Video

Youtube search... ...Google search

"The draw here is that visual imagery is visceral and compelling and we respond to it," says Hany Farid, associate dean and head of the School of Information at UC Berkeley. "We are visual beings. When you see your grandmother or Mark Twain come alive, there's something fascinating about it." A new program can animate old photos. But there's nothing human about artificial intelligence | AJ Willingham - CNN

Deep Nostalgia™ allows you to animate the faces in your old family photos. Utilizing state-of-the-art deep learning technology licensed by MyHeritage from D-ID, Deep Nostalgia™ creates high-quality, realistic video footage from still photos.



Video-to-Video

Youtube search... ...Google search


Video Editing

Youtube search... ...Google search


In the last few years, artificial intelligence (AI) and machine learning (ML) have both started to feature more prominently in technology. That is especially the case in video editing, where artificial intelligence is being integrated with more and more ways. How Artificial Intelligence is Transforming Video Editing | TechnologyHQ


Captioning

Youtube search... ...Google search

Image Captioning Through Neural Networks
An image captioning model using VGG16 feature extraction (CNN) and LSTM (RNN) neural networks. With Python, [TensorFlow]] and Keras

Automated Image Captioning with ConvNets and Recurrent Nets
Andrej Karpathy and Fei-Fei Li ...http://cs.stanford.edu/people/karpathy/sfmltalk.pdf

A Simple Way to Automatically Transcribe Video/Audio to Text
Here is how to auto generate subtitles from any video with Google docs. It also works if you want to convert audio to text. Useful for creating subtitles and closed Captions for all your Youtube videos. The trick here is to make the recording and playback devices same. This will give good quality and help you automatically transcribe video to text.

I compared 3 AI Image Caption Models - GIT vs BLIP vs ViT+GPT2 - Image-to-Text Models
I took10 different images to compare GIT, BLIP and ViT+GPT2, 3 state-of-the-art vision+language models.

  • GIT: A Generative Image-to-text Transformer for Vision and Language
  • BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
  • ViT+GPT2 - Image Captioning using transformers

Level Up - Automated Subtitles with AI
Welcome to Level Up! the show where we show you how to build solutions hands-on with Google Cloud Platform. In this episode, Solutions Architect Markku Lepistö will show you how to create subtitles for videos using Cloud AI services. And then how to translate the subtitles to quickly add support for multiple languages!

00:35 - Example of extracting the dialog as an audio track 01:15 - Enable Google Cloud AI services 02:37 - Coding the client app in Python 09:20 - Reviewing the results 09:48 - Uploading subtitles to YouTube Studio 10:44 - Testing the results

AI-Driven Image Captioning For Inclusive Productivity
Advances in hybrid intelligence, deep learning, and related artificial intelligence techniques have provided us with a remarkable opportunity to ensure the future of work will be even more inclusive to more people than ever before. Because the communication and products of work increasingly comprise images—photos, charts, maps, and the like—that are often not accessible, people who are blind or low vision face unique challenges. One promising technology is the automated understanding and captioning of images. Microsoft Office 365 applications, for example, can use APIs from Microsoft Azure Cognitive Services to automatically add alt text to images. But there remain many hurdles to making these captions truly useful and usable. In this breakout session, we will explore the state of the art and potential for advancement in automated image captioning, including data capture and curation for training, caption presentation and interactivity, and computer vision.

Microsoft AI breakthrough in automatic image captioning
Microsoft researchers have built an artificial intelligence system that can generate captions for images that are, in many cases, more accurate than what was previously possible. The breakthrough is a milestone in Microsoft’s push to make its products and services inclusive and accessible to all users.

Pytorch Image Captioning Tutorial
In this tutorial we go through how an image captioning system works and implement one from scratch. Specifically we're looking at the caption dataset Flickr8k. There are multiple ways to improve the model: train a larger model (the one used is relatively small), train for longer and improve the model by adding attention similar to this paper: http://arxiv.org/abs/1502.03044. Video of dataset (link in that video description to download the dataset yourself): http://youtu.be/9sHcLvVXsns Support My Channel Through Patreon: http://www.patreon.com/aladdinpersson

Style-Based Generator Architecture

Runway

Runway ... can edit videos in real-time, collaborate, and use more than 30 AI magic tools.

Tool-related

Youtube search... ...Google search

Image or Video Forgeries

Youtube search... ...Google search

I built a Secret AI Youtube Channel

I challenged myself to earn $100 in the first 3 days of starting a secret AI generated Youtube channel! I didn't use any of my existing connections to promote the channel and used AI tools like ChatGPT, MidJourney, and D-ID every step of the way to automate my own job as a Machine Learning Educator. I've open sourced all the code i used because I want more people to do this, AI avatars are going to be on social media alongside humans, building followers and influencing society. Was I successful in my goal? Find out in this video!