2024 Generative AI Unveiling Next-Level Evolution

Since the advent of artificial intelligence technology in content production, the creation of texts, images, and audio through algorithms and models has progressed significantly. In 2024, the groundbreaking product Sora has once again swept the video domain.

In gaming, intelligent NPCs (non-player characters) that can interact and short videos that can be generated with a single text input, including dubbing and subtitles, have become possible due to improved hardware capabilities. The core algorithms of AIGC (generative AI) and the training levels of large models have continually advanced, better integrating into applications and scenarios.

Despite the significant success of AIGC in the interactive industry, challenges and issues remain, including legal risks and copyright disputes that need ongoing adjustments and improvements.

Imagination transformed into dynamic visuals marks the dawn of a new era in visual storytelling.

In the 1950s, as AI emerged and computer technology and software algorithms progressed, AI gradually became a hot topic in the technology field and an area of application. The content production model of the internet also evolved through three phases: PGC (Professional Generated Content), UGC (User Generated Content), and AIGC (AI Generated Content).

By 2023, innovations represented by AIGC reignited enthusiasm for the technology revolution, industrial empowerment, and technological innovation and entrepreneurship. People witnessed the rapid progress in text-to-text and text-to-image applications. AIGC applications penetrated various countries, industries, and scenarios at an astonishing speed, with the effectiveness of the new technologies being verified and confirmed in multiple areas, providing opportunities for people to complete existing tasks in entirely new ways.

At the start of 2024, OpenAI released the explosive video generation model Sora, capable of producing 60-second coherent videos based on prompt words alone, vastly surpassing the industry’s average video generation length of about 4 seconds.

Sora’s debut heralds a new era of visual storytelling, capable of transforming human imagination into vivid, dynamic visuals and turning the magic of words into a visual feast. In this future woven by data and algorithms, AIGC technologies like Sora are redefining interactions between people and the digital world in unique ways.

Penetrating medical finance drives multi-domain AI may become infrastructure.

AI’s potential to become a fundamental infrastructure like water, electricity, and networks is evident as it penetrates and transforms numerous industries. According to the “2024 AIGC Development Trends Report” released by multiple platforms including iDesign, in the medical field, AI applications can now perform precise diagnostic assistance. For example, Google Health’s deep learning models have surpassed human experts in breast cancer screening accuracy. These models, with their powerful image recognition and pattern analysis capabilities, can identify subtle abnormalities in thousands of X-ray images that might be overlooked by the human eye.

The financial sector has also undergone transformations driven by AI. Financial institutions use complex algorithms to predict market trends, manage risks, and even automate trading. Machine learning can analyze large volumes of historical data to identify patterns that are elusive to human observers. For instance, through deep learning, AI can detect minute market fluctuations in high-frequency trading and respond within milliseconds, a feat unmatchable by human traders.

In the automotive sector, AI applications in autonomous driving demonstrate the ability to perform tasks in highly complex and dynamic environments. Systems like Tesla’s Autopilot and Google’s Waymo use advanced sensor arrays and AI algorithms to achieve autonomous navigation and decision-making in vehicles, often surpassing human drivers in some scenarios.

In entertainment and creative industries, AI has opened new dimensions for human creativity. OpenAI’s DALL-E can generate astonishing images based on brief text descriptions, and AI music programs can create captivating melodies. These AI tools not only inspire creators but also transform the way art is made. This creative output of AI is unprecedented, contributing new elements to human cultural diversity.

AI “specializes” in producing virtual content, and the interactive entertainment industry is undergoing a major transformation.

The interactive entertainment industry, primarily focused on producing virtual content, has been significantly impacted by AIGC. A report by DianDian Data on the impact trends of AIGC in the interactive entertainment industry suggests that this sector, encompassing games, videos, web articles, and audio, has not only undergone structural changes in content production methods due to AIGC but also in the presentation of content itself.

Among the cases listed in the report, a domestic mobile game “Against the Current” featuring AIGC capabilities introduced “intelligent NPCs” for the first time. NPCs can autonomously interact, engaging not just in text-based chats but also in movements, gift-giving, trading, and even dueling, with players able to join the interactions at any time, thus influencing NPC behavior. Surprisingly, NPCs are equipped with “memory.” If you show kindness to an NPC, they might recognize you in a future encounter, whisper to you, or engage warmly in conversation about past interactions. Conversely, if a player antagonizes an NPC, the NPC might refuse to engage in further conversation and might even resort to combat.

In addition to intelligent NPCs, the game also deeply integrates AIGC features such as NPC voice generation, NPC speech lip-sync, and one-click player character creation (text input generates facial features), contributing significantly to player retention and generating high levels of discussion.

Another popular domestic game, “Egg Party,” added a “Creator Workshop” feature in its June 2023 anniversary update, offering three types of generation functions: a pixel art wall (uploaded pictures automatically generate pixel art), pixel text (input text generates pixel art), and flowers (input parameters to mass-produce flowers). With subsequent updates, the functionality of the generator continued to expand and improve, especially with the “Universal Generator” introduced in December last year, which again broke the ceiling of UGC creation. Players only need to input prompt words to generate corresponding items in the game and arrange them within scenes.

The well-known short video creation app “JianYing” now provides a one-stop service from text editing and video generation based on user input themes and key points to final video editing, including AI portrait, AI effects, and AI graphics tools to facilitate user-generated content. The app’s voice beautification feature helps users create digital audio avatars. These features, deeply integrated with AIGC, not only enhance creative efficiency but also lower the barriers to creation. While the effectiveness of individual features might currently struggle to match professional text and image generation software like ChatGPT, Midjourney, or Stable Diffusion, the integration of resources, ease of use, and consistency of generated materials for video editing provide users with a smooth and efficient practical experience.

The development of AI should respect human values, and laws and regulations are constantly being improved and perfected.

It is crucial to be aware that while AI has seen success in specific fields, it also faces limitations. As AI transitions from laboratory research to real-world scenarios, challenges related to technology, regulations, ethics, and societal acceptance become increasingly apparent. To fully leverage AI’s potential, finding a balance in these areas is essential to ensure that AI technologies can promote societal progress while respecting human values.

As more content is generated by AIGC, the accuracy and credibility of the content become critical issues. It is foreseeable that AIGC will incorporate more algorithms and mechanisms to address challenges such as misinformation, plagiarism, and other violations, ensuring the authenticity and legality of generated content.

On the corporate level, for instance, the gaming and software platform Steam issued a significant policy adjustment on January 10, 2024, regarding “AIGC+Games.” Games involving AIGC are categorized into “pre-generated content” and “real-time generated content,” with developers required to submit relevant information explaining how AIGC is used in game development and operation. This information is also disclosed to players, who can report any AIGC-related content at any time. This marks the first public statement by a leading game platform explicitly addressing its stance on AIGC-related products.

In addition to platforms like Steam continuously adjusting their strategies, countries worldwide are also continually refining relevant laws and regulations. In the context of common images in interactive entertainment products, South Korea has published “Generative AI Copyright Guidelines,” which clarify that AIGC-generated content is not protected by copyright. Conversely, the U.S. government considers works directly generated by AIGC as not legally protected, although works that involve collaboration with AIGC and reflect human creative achievements can be copyrighted.

In China, on July 13, 2023, the Cyberspace Administration of China and six other departments jointly issued the “Interim Measures for the Management of Generative Artificial Intelligence Services” aimed at promoting the healthy development and standardized application of generative AI, safeguarding national security and public interests, and protecting the legitimate rights and interests of citizens, legal entities, and other organizations. Cities like Beijing and Shanghai have also introduced related regulations and ordinances.

Currently, AI compliance issues mainly involve data, AI products, personal services, and industry applications. According to the International Data Corporation (IDC), within the framework of laws and regulations, all major model developers and AI ecosystem vendors must take legal and regulatory issues seriously, especially by optimizing algorithm models to enhance the explainability of artificial intelligence.

Related:

  1. Historic Computer Storage – Punch Cards: A Deep Dive
End-of-Yunze-blog

Disclaimer: This article is created by the original author. The content of the article represents their personal opinions. Our reposting is for sharing and discussion purposes only and does not imply our endorsement or agreement. If you have any objections, please contact us through the provided channels.

Leave a Reply