All Aboard: NVIDIA Scores 23 World Records for Route Optimization

All Aboard: NVIDIA Scores 23 World Records for Route Optimization

With nearly two dozen world records to its name, NVIDIA cuOpt now holds the top spot for 100% of the largest routing benchmarks in the last three years. And this means the route optimization engine allows industries to hop on board for all kinds of cost-saving efficiencies.

Kawasaki Heavy Industries and SyncTwin are among the companies that are riding cuOpt for logistics improvements.

Today at GTC 2024, NVIDIA founder and CEO Jensen Huang announced that cuOpt is moving into general availability.

“With cuOpt, NVIDIA is reinventing logistics management and operations research. It is NVIDIA’s pre-quantum computer, driving transformational operational efficiencies for deliveries, service calls, warehouses and factories, and supply chains,” he said.

The NVIDIA cuOpt microservice, part of the NVIDIA AI Enterprise software platform, makes accelerated optimization for real-time dynamic rerouting, factory optimization and robotic simulations available to any organization.

Companies can embed cuOpt into the advanced 3D tools, applications and USD-based workflows they develop with NVIDIA Omniverse, a software platform for developing and deploying advanced 3D applications and pipelines based on OpenUSD.

Implemented together, cuOpt, Omniverse and NVIDIA Metropolis for Factories can help optimize and create safe environments in logistics-heavy facilities that rely on complex automation, precise material flow and human-robot interaction, such as automotive factories, semiconductor fabs and warehouses.

cuOpt has been continuously tested against the best-known solutions on the most studied benchmarks for route optimization, with results up to 100x faster than CPU-based implementations. With 15 records from the Gehring & Homberger vehicle routing benchmark and eight from the Li & Lim pickup and delivery benchmark, cuOpt has demonstrated the world’s best accuracy with the fastest times.

AI promises to deliver logistics efficiencies spanning from transportation networks to manufacturing and much more.

Delivering Cost-Savings for Inspections With cuOpt

Kawasaki Heavy Industries is a manufacturing company that’s been building large machinery for more than a hundred years. The Japanese company partnered with Slalom and used cuOpt to create routing efficiencies for the development of its AI-driven Kawasaki Track Maintenance Platform.

Railroad track maintenance is getting an AI makeover worldwide. Traditionally, track inspections and maintenance are time-consuming and difficult to manage to keep trains running on time. But track maintenance is critical for safety and transportation service. Railway companies are automating track inspections with AI and machine learning paired with digital cameras, lasers and gyrometric sensors.

Kawasaki is harnessing the edge computing of NVIDIA Jetson AGX Orin to develop track inspections on its Track Maintenance Platform for running onboard trains. The platform enables customers to improve vision models with the data collected on tracks for advances in the inspection capability of the edge-based AI system.

The platform provides maintenance teams data on track conditions that allows them to prioritize repairs, creating increased safety and reliability of operations.

According to Kawasaki, it’s estimated that such an AI-driven system can save $218 million a year for seven companies from automating their track inspections.

Creating Manufacturing Efficiencies With cuOpt and Omniverse

A worldwide leader in automotive seating manufacturing has adopted SyncTwin’s digital twin capability, which is driven by Omniverse and cuOpt, to improve its operations with AI.

The global automotive seating manufacturer has a vast network of loading docks for the delivery of raw materials, and forklifts for unloading and transporting them to storage and handling areas to ensure a steady supply to production lines. SyncTwin’s connection to cuOpt delivers routing efficiencies that optimize all of these moving parts — from vehicles to robotic pallet jacks.

As the SyncTwin solution was developed on top of Omniverse and USD, manufacturers can ensure that their various factory planning tools can contribute to the creation of a rich digital twin environment. Plus, they eliminate tedious manual data collection and gain new insights from their previously disconnected data.

Attend GTC to explore how cuOpt is achieving world-record accuracy and performance to solve complex problems. Learn more about cuOpt world records in our tech blog. Learn more about Omniverse.

Read More

All Eyes on AI: Automotive Tech on Full Display at GTC 2024

All Eyes on AI: Automotive Tech on Full Display at GTC 2024

All eyes across the auto industry are on GTC — the global AI conference running in San Jose, Calif., and online through Thursday, March 21 — as the world’s top automakers and tech leaders converge to showcase the latest models, demo new technologies and dive into the remarkable innovations reshaping the sector.

Attendees will experience how generative AI and software-defined computing are advancing the automotive landscape and transforming the behind-the-wheel experience to become safer, smarter and more enjoyable.

Automakers Adopting NVIDIA DRIVE Thor

NVIDIA founder and CEO Jensen Huang kicked off GTC with a keynote address in which he revealed that NVIDIA DRIVE Thor, which combines advanced driver assistance technology and in-vehicle infotainment, now features the newly announced NVIDIA Blackwell GPU architecture for transformer and generative AI workloads.

Following the keynote, top EV makers shared how they will integrate DRIVE Thor into their vehicles. BYD, the world’s largest electric vehicle maker, is expanding its ongoing collaboration with NVIDIA and building its next-generation EV fleets on DRIVE Thor. Hyper, a premium luxury brand owned by GAC AION, is announcing it has selected DRIVE Thor for its new models, which will begin production in 2025. XPENG will use DRIVE Thor as the AI brain of its next-generation EV fleets. These EV makers join Li Auto and ZEEKR, which previously announced they’re building their future vehicle roadmaps on DRIVE Thor.

Additionally, trucking, robotaxis and goods delivery vehicle makers are announcing support for DRIVE Thor. Nuro is choosing DRIVE Thor to power the Nuro Driver. Plus is announcing that future generations of its level 4 solution, SuperDrive, will run on DRIVE Thor. Waabi is  leveraging DRIVE Thor to deliver the first generative AI-powered autonomous trucking solution to market. WeRide, in cooperation with tier 1 partner Lenovo Vehicle Computing, is creating level 4 autonomous driving solutions for commercial applications built on DRIVE Thor.

And, DeepRoute.ai is unveiling its new smart driving architecture powered by NVIDIA DRIVE Thor, scheduled to launch next year.

Next-Generation Tech on the Show Floor

The GTC exhibit hall is buzzing with excitement as companies showcase the newest vehicle models and offer technology demonstrations.

Attendees have the opportunity to see firsthand the latest NVIDIA-powered vehicles on display,  including Lucid Air, Mercedes-Benz Concept CLA Class, Nuro R3, Polestar 3, Volvo EX90, WeRide Robobus, and an Aurora truck. The Lucid Air is available for test drives during the week.

A wide array of companies are showcasing innovative automotive technology at GTC, including Foretellix, Luminar and MediaTek, which is launching its Dimensity Auto Cockpit chipsets at the show. The new solutions harness NVIDIA’s graphics and AI technologies to help deliver state-of-the-art in-vehicle user experiences, added safety and security capabilities.

Also Announced at GTC: Omniverse Cloud APIs, Generative AI

  • Omniverse Cloud APIs, announced today at NVIDIA GTC, are poised to accelerate the path to autonomy by enabling high-fidelity sensor simulation for AV development and validation. Developers and software vendors such as CARLA, MathWorks, MITRE, Foretellix and Voxel51 underscore the broad appeal of these APIs in autonomous vehicles.
  • Generative AI developers including Cerence, Geely, Li Auto, NIO, SoundHound, Tata Consulting Services and Wayve announced plans to transform the in-vehicle experience by using NVIDIA’s cloud-to-edge technology to help develop intelligent AI assistants, driver and passenger monitoring, scene understanding and more.

AI and Automotive Sessions Available Live and on Demand

Throughout the week, the world’s foremost experts on automotive technology will lead a broad array of sessions and panels at GTC, including:

On DRIVE Developer Day, taking place Thursday, March 21, NVIDIA’s engineering experts will highlight the latest DRIVE features and developments through a series of deep-dive sessions on how to build safe and robust self-driving systems.

See the full schedule of automotive programming at GTC and be sure to tune in.

Read More

Generative AI Developers Harness NVIDIA Technologies to Transform In-Vehicle Experiences

Generative AI Developers Harness NVIDIA Technologies to Transform In-Vehicle Experiences

Cars of the future will be more than just modes of transportation; they’ll be intelligent companions, seamlessly blending technology and comfort to enhance driving experiences, and built for safety, inside and out.

NVIDIA GTC, running this week at the San Jose Convention Center, will spotlight the groundbreaking work NVIDIA and its partners are doing to bring the transformative power of generative AI, large language models and visual language models to the mobility sector.

At its booth, NVIDIA will showcase how it’s building automotive assistants to enhance driver safety, security and comfort through enhanced perception, understanding and generative capabilities powered by deep learning and transformer models.

Talking the Talk

LLMs, a form of generative AI, largely represent a class of deep-learning architectures known as transformer models, which are neural networks adept at learning context and meaning.

Vision language models are another derivative of generative AI, that offer image processing and language understanding capabilities. Unlike traditional or multimodal LLMs that primarily process and generate text-based data, VLMs can analyze and generate text via images or videos.

And retrieval-augmented generations allows manufacturers to access knowledge from a specific database or the web to assist drivers.

These technologies together enable NVIDIA Avatar Cloud Engine, or ACE, and multimodal language models to work together with the NVIDIA DRIVE platform to let automotive manufacturers develop their own intelligent in-car assistants.

For example, an Avatar configurator can allow designers to build unique, brand-inspired personas for their cars, complete with customized voices and emotional attributes. These AI-animated avatars can engage in natural dialogue, providing real-time assistance, recommendations and personalized interactions.

Furthermore, AI-enhanced surround visualization enhances vehicle safety using 360-degree camera reconstruction, while the intelligent assistant sources external information, such as local driving laws, to inform decision-making.

Personalization is paramount, with AI assistants learning driver and passenger habits and adapting its behavior to suit occupants’ needs.

Generative AI for Automotive in Full Force at GTC 

Several NVIDIA partners at GTC are also showcasing their latest generative AI developments using NVIDIA’s edge-to-cloud technology:

  • Cerence’s CaLLM is an automotive-specific LLM that serves as the foundation for the company’s next-gen in-car computing platform, running on NVIDIA DRIVE. The platform, unveiled late last year, is the future of in-car interaction, with an automotive- and mobility-specific assistant that provides an integrated in-cabin experience. Cerence is collaborating with NVIDIA engineering teams for deeper integration of CaLLM with the NVIDIA AI Foundation Models. Through joint efforts, Cerence is harnessing NVIDIA DGX Cloud as the development platform, applying guardrails for enhanced performance, and leveraging NVIDIA AI Enterprise to optimize inference. NVIDIA and Cerence will continue to partner and pioneer this solution together with several automotive OEMs this year.
  • Wavye is helping usher in the new era of Embodied AI for autonomy, their next-generation AV2.0 approach is characterized by a large Embodied AI foundation model that learns to drive self-supervised using AI end-to-end —from sensing, as an input, to outputting driving actions. The British startup has already unveiled its GAIA-1, a generative world model for AV development running on NVIDIA; alongside LINGO-1, a closed-loop driving commentator that uses natural language to enhance the learning and explainability of AI driving models.
  • Li Auto unveiled its multimodal cognitive model, Mind GPT, in June. Built on NVIDIA TensorRT-LLM, an open-source library, it serves as the basis for the electric vehicle maker’s AI assistant, Lixiang Tongxue, for scene understanding, generation, knowledge retention and reasoning capabilities. Li Auto is currently developing DriveVLM to enhance autonomous driving capabilities, enabling the system to understand complex scenarios, particularly those that are challenging for traditional AV pipelines, such as unstructured roads, rare and unusual objects, and unexpected traffic events. This advanced model is trained on the NVIDIA GPUs and utilizes TensorRT-LLM and NVIDIA Triton Inference Server for data generation in the data center. With inference optimized by NVIDIA DRIVE and TensorRT-LLM, DriveVLMs perform efficiently on embedded systems.
  • NIO launched its NOMI GPT, which offers a number of functional experiences, including NOMI Encyclopedia Q&A, Cabin Atmosphere Master and Vehicle Assistant. With the capabilities enabled by LLMs and an efficient computing platform powered by NVIDIA AI stacks, NOMI GPT is capable of basic speech recognition and command execution functions and can use deep learning to understand and process more complex sentences and instructions inside the car.
  • Geely is working with NVIDIA to provide intelligent cabin experiences, along with accelerated edge-to-cloud deployment. Specifically, Geely is applying generative AI and LLM technology to provide smarter, personalized and safer driving experiences, using natural language processing, dialogue systems and predictive analytics for intelligent navigation and voice assistants. When deploying LLMs into production, Geely uses NVIDIA TensorRT-LLM to achieve highly efficient inference. For more complex tasks or scenarios requiring massive data support, Geely plans to deploy large-scale models in the cloud.
  • Waabi is building AI for self-driving and will use the generative AI capabilities afforded by NVIDIA DRIVE Thor for its breakthrough autonomous trucking solutions, bringing safe and reliable autonomy to the trucking industry.
  • Lenovo is unveiling a new AI acceleration engine, dubbed UltraBoost, which will run on NVIDIA DRIVE, and features an AI model engine and AI compiler tool chains to facilitate the deployment of LLMs within vehicles.
  • SoundHound AI is using NVIDIA to run its in-vehicle voice interface — which combines both real-time and generative AI capabilities — even when a vehicle has no cloud connectivity. This solution also offers drivers access to SoundHound’s Vehicle Intelligence product, which instantly delivers settings, troubleshooting and other information directly from the car manual and other data sources via natural speech, as opposed to through a physical document.
  • Tata Consulting Services (part of the TATA Group), through its AI-based technology and engineering innovation, has built its automotive GenAI suite powered by NVIDIA GPUs and software frameworks. It accelerates the design, development, and validation of software-defined vehicles, leveraging the various LLMs and VLMs for in-vehicle and cloud-based systems.
  • MediaTek is announcing four automotive systems-on-a-chip within its Dimensity Auto Cockpit portfolio, offering powerful AI-based in-cabin experiences for the next generation of intelligent vehicles that span from premium to entry level. To support deep learning capabilities, the Dimensity Auto Cockpit chipsets integrate NVIDIA’s next-gen GPU-accelerated AI computing and NVIDIA RTX-powered graphics to run LLMs in the car, allowing vehicles to support chatbots, rich content delivery to multiple displays, driver alertness detection and other AI-based safety and entertainment applications.

Check out the many automotive talks on generative AI and LLMs throughout the week of GTC.

Register today to attend GTC in person, or tune in virtually, to explore how generative AI is making transportation safer, smarter and more enjoyable.

Read More

NVIDIA Unveils Digital Blueprint for Building Next-Gen Data Centers

NVIDIA Unveils Digital Blueprint for Building Next-Gen Data Centers

Designing, simulating and bringing up modern data centers is incredibly complex, involving multiple considerations like performance, energy efficiency and scalability.

It also requires bringing together a team of highly skilled engineers across compute and network design, computer-aided design (CAD) modeling, and mechanical, electrical and thermal design.

NVIDIA builds the world’s most advanced AI supercomputers and at GTC unveiled its latest — a large cluster based on the NVIDIA GB200 NVL72 liquid-cooled system. It consists of two racks, each containing 18 NVIDIA Grace CPUs and 36 NVIDIA Blackwell GPUs, connected by fourth-generation NVIDIA NVLink switches.

On the show floor, NVIDIA demoed this fully operational data center as a digital twin in NVIDIA Omniverse, a platform for connecting and building generative AI-enabled 3D pipelines, tools, applications and services.

To bring up new data centers as fast as possible, NVIDIA first built its digital twin with software tools connected by Omniverse. Engineers unified and visualized multiple CAD datasets in full physical accuracy and photorealism in Universal Scene Description (OpenUSD) using the Cadence Reality digital twin platform, powered by NVIDIA Omniverse APIs.

Design, Simulate and Optimize With Enhanced Efficiency and Accuracy

The new GB200 cluster is replacing an existing cluster in one of NVIDIA’s legacy data centers. To start the digital build-out, technology company Kinetic Vision scanned the facility using the NavVis VLX wearable lidar scanner to produce highly accurate point cloud data and panorama photos.

Then, Prevu3D software was used to remove the existing clusters and convert the point cloud to a 3D mesh. This provided a physically accurate 3D model of the facility, in which the new digital data center could be simulated.

Engineers combined and visualized multiple CAD datasets with enhanced precision and realism by using the Cadence Reality platform. The platform’s integration with Omniverse provided a powerful computing platform that enabled teams to develop OpenUSD-based 3D tools, workflows and applications.

Omniverse Cloud APIs also added interoperability with more tools, including PATCH MANAGER and NVIDIA Air. With PATCH MANAGER, the team designed the physical layout of their cluster and networking infrastructure, ensuring that cabling lengths were accurate and routing was properly configured.

The team used Cadence’s Reality Digital Twin solvers, accelerated by NVIDIA Modulus APIs and NVIDIA Grace Hopper, to simulate airflows, as well as the performance of the new liquid-cooling systems from partners like Vertiv and Schneider Electric. The integrated cooling systems in the GB200 trays were simulated and optimized using solutions from Ansys, which brought simulation data into the digital twin.

The demo showed how digital twins can allow users to fully test, optimize and validate data center designs before ever producing a physical system. By visualizing the performance of the data center in the digital twin, teams can better optimize their designs and plan for what-if scenarios.

Users can also enhance data center and cluster designs by balancing disparate sets of boundary conditions, such as cabling lengths, power, cooling and space, in an integrated manner — enabling engineers and design teams to bring clusters online much faster and with more efficiency and optimization than before.

Learn more about NVIDIA Omniverse and NVIDIA Blackwell.

Read More

New NVIDIA Storage Partner Validation Program Streamlines Enterprise AI Deployments

New NVIDIA Storage Partner Validation Program Streamlines Enterprise AI Deployments

A sharp increase in generative AI deployments is driving business innovation for enterprises across industries. But it’s also posing significant challenges for their IT teams, as slowdowns from long and complex infrastructure deployment cycles prevent them from quickly spinning up AI workloads using their own data.

To help overcome these barriers, NVIDIA has introduced a storage partner validation program for NVIDIA OVX computing systems. The high-performance storage systems leading the way in completing the NVIDIA OVX storage validation are DDN, Dell PowerScale, NetApp, Pure Storage and WEKA.

NVIDIA OVX servers combine high-performance, GPU-accelerated compute with high-speed storage access and low-latency networking to address a range of complex AI and graphics-intensive workloads. Chatbots, summarization and search tools, for example, require large amounts of data, and high-performance storage is critical to maximize system throughput.

To help enterprises pair the right storage with NVIDIA-Certified OVX servers, the new program provides a standardized process for partners to validate their storage appliances. They can use the same framework and testing that’s needed to validate storage for the NVIDIA DGX BasePOD reference architecture.

To achieve validation, partners must complete a suite of NVIDIA tests measuring storage performance and input/out scaling across multiple parameters that represent the demanding requirements of various enterprise AI workloads. This includes combinations of different I/O sizes, varying numbers of threads, buffered I/O vs. direct I/O, random reads, re-reads and more.

Each test is run multiple times to verify the results and gather the required data, which is then audited by NVIDIA engineering teams to determine whether the storage system has passed.

The program offers prescriptive guidance to ensure optimal storage performance and scalability for enterprise AI workloads with NVIDIA OVX systems. But the overall design remains flexible, so customers can tailor their system and storage choices to fit their existing data center environments and bring accelerated computing to wherever their data resides.

Generative AI use cases have fundamentally different requirements than traditional enterprise applications, so IT teams must carefully consider their compute, networking, storage and software choices to ensure high performance and scalability.

NVIDIA-Certified Systems are tested and validated to provide enterprise-grade performance, manageability, security and scalability for AI workloads. Their flexible reference architectures help deliver faster, more efficient and more cost-effective deployments than independently building from the ground up.

Powered by NVIDIA L40S GPUs, OVX servers include NVIDIA AI Enterprise software with NVIDIA Quantum-2 InfiniBand or NVIDIA Spectrum-X Ethernet networking, as well as NVIDIA BlueField-3 DPUs. They’re optimized for generative AI workloads, including training for smaller LLMs (for example, Llama 2 7B or 70B), fine-tuning existing models and inference with high throughput and low latency.

NVIDIA-Certified OVX servers are now available and shipping from global system vendors, including GIGABYTE, Hewlett Packard Enterprise and Lenovo. Comprehensive, enterprise-grade support for these servers is provided by each system builder, in collaboration with NVIDIA.

Availability 

Validated storage solutions for NVIDIA-Certified OVX servers are now available, and reference architectures will be published over the coming weeks by each of the storage and system vendors. Learn more about NVIDIA OVX Systems.

Read More

From Atoms to Supercomputers: NVIDIA, Partners Scale Quantum Computing

From Atoms to Supercomputers: NVIDIA, Partners Scale Quantum Computing

The latest advances in quantum computing include investigating molecules, deploying giant supercomputers and building the quantum workforce with a new academic program.

Researchers in Canada and the U.S. used a large language model to simplify quantum simulations that help scientists explore molecules.

“This new quantum algorithm opens the avenue to a new way of combining quantum algorithms with machine learning,” said Alan Aspuru-Guzik, a professor of chemistry and computer science at the University of Toronto, who led the team.

The effort used CUDA-Q, a hybrid programming model for GPUs, CPUs and the QPUs quantum systems use. The team ran its research on Eos, NVIDIA’s H100 GPU supercomputer.

Software from the effort will be made available for researchers in fields like healthcare and chemistry. Aspuru-Guzik will detail the work in a talk at GTC.

Quantum Scales for Fraud Detection

At HSBC, one of the world’s largest banks, researchers designed a quantum machine learning application that can detect fraud in digital payments.

The bank’s quantum machine learning algorithm simulated a whopping 165 qubits on NVIDIA GPUs. Research papers typically don’t extend beyond 40 of these fundamental calculating units quantum systems use.

HSBC used machine learning techniques implemented with CUDA-Q and cuTensorNet software on NVIDIA GPUs to overcome challenges simulating quantum circuits at scale. Mekena Metcalf, a quantum computing research scientist at HSBC (pictured above), will present her work in a session at GTC.

Raising a Quantum Generation

In education, NVIDIA is working with nearly two dozen universities to prepare the next generation of computer scientists for the quantum era. The collaboration will design curricula and teaching materials around CUDA-Q.

“Bridging the divide between traditional computers and quantum systems is essential to the future of computing,” said Theresa Mayer, vice president for research at Carnegie Mellon University. “NVIDIA is partnering with institutions of higher education, Carnegie Mellon included, to help students and researchers navigate and excel in this emerging hybrid environment.”

To help working developers get hands-on with the latest tools, NVIDIA co-sponsored QHack, a quantum hackathon in February. The winning project, developed by Gopesh Dahale of Qkrishi — a quantum company in Gurgaon, India — used CUDA-Q to develop an algorithm to simulate a material critical in designing better batteries.

A Trio of New Systems

Two new systems being deployed further expand the ecosystem for hybrid quantum-classical computing.

The largest of the two, ABCI-Q at Japan’s National Institute of Advanced Industrial Science and Technology, will be one of the largest supercomputers dedicated to research in quantum computing. It will use CUDA-Q on NVIDIA H100 GPUs to advance the nation’s efforts in the field.

In Denmark, the Novo Nordisk Foundation will deploy an NVIDIA DGX SuperPOD, half of which will be dedicated to research in quantum computing as part of the country’s national plan to advance the technology.

The new systems join Australia’s Pawsey Supercomputing Research Centre, which announced in February it will run CUDA-Q on NVIDIA Grace Hopper Superchips at its National Supercomputing and Quantum Computing Innovation Hub.

Partners Drive CUDA Quantum Forward

In other news, Israeli startup Classiq released at GTC a new integration with CUDA-Q. Classiq’s quantum circuit synthesis lets high-level functional models automatically generate optimized quantum programs, so researchers can get the most out of today’s quantum hardware and expand the scale of their work on future algorithms.

Software and service provider QC Ware is integrating its Promethium quantum chemistry package with the just-announced NVIDIA Quantum Cloud.

ORCA Computing, a quantum systems developer headquartered in London, released results running quantum machine learning on its photonics processor with CUDA-Q. In addition, ORCA was selected to build and supply a quantum computing testbed for the UK’s National Quantum Computing Centre which will include an NVIDIA GPU cluster using CUDA-Q.

Nvidia and Infleqtion, a quantum technology leader, partnered to bring cutting-edge quantum-enabled solutions to Europe’s largest cyber-defense exercise with NVIDIA-enabled Superstaq software.

A cloud-based platform for quantum computing, qBraid, is integrating CUDA-Q into its developer environment. And California-based BlueQubit described in a blog how NVIDIA’s quantum technology, used in its research and GPU service, provides the fastest and largest quantum emulations possible on GPUs.

Get the Big Picture at GTC

To learn more, watch a session about how NVIDIA is advancing quantum computing and attend an expert panel on the topic, both at NVIDIA GTC, a global AI conference, running March 18-21 at the San Jose Convention Center.

And get the full view from NVIDIA founder and CEO Jensen Huang in his GTC keynote.

Read More

NVIDIA Maxine Developer Platform to Transform $10 Billion Video Conferencing Industry

NVIDIA Maxine Developer Platform to Transform $10 Billion Video Conferencing Industry

Video conferencing has allowed many to be productive from anywhere.

Now NVIDIA is boosting the productivity of the developers of video conferencing, call center and streaming applications within the $10 billion industry by allowing them to easily integrate AI into their workflows.

The new release of the Maxine AI Developer Platform transforms the creation of state-of-the-art, real-time video conferencing applications with features enabling enhanced user flexibility, engagement and efficiency.

Available through the NVIDIA AI Enterprise software platform, Maxine allows developers to tap into the latest AI-driven features — such as enhanced video and audio quality and augmented reality effects — to turn users’ everyday video calls into engaging, collaborative experiences.

Expanding Video Conferencing With New Maxine Features 

The Maxine AI Developer Platform enables developers to easily access and integrate real-time, AI-enhanced features that increase the quality of engagement for video conferencing users.

Features like noise reduction, video denoising and upscaling, and studio voice improve the quality of audio and video streams. With advanced capabilities like eye-gaze correction, live portrait and future features such as video relighting and cloud microservice Maxine 3D, developers can enhance video conferencing engagement and personal connection.

The platform extends the utility of the state-of-the-art AI models for audio, video and augmented reality effects with multiple ways for developers to deliver Maxine features with offerings of software development kits, microservices, and even application programming interface (API) endpoints delivered from NVIDIA’s cloud infrastructure.

Maxine production feature updates available now include:

  • Eye Contact: The improved eye contact model provides gaze redirection with natural eye movements for deeper meeting participant engagement.
  • Voice Font: This new model matches the speaker’s voice to a target voice while keeping linguistic information and prosody (rhythm and tone) unchanged.
  • Background Noise Reduction (BNR) 2.0: This model updates noise reduction for human listening and for language encoding with a specific effort to decrease encoding word error rates.

New features available for early access this spring include:

  • Speech Live Portrait: This model allows a user to drive their portrait with direct speech or any audio source, allowing users to always look their best during a conference call.
  • Studio Voice: This model enables ordinary headset, laptop and desktop microphones to deliver the sound of a high-end studio mic, allowing users to always sound their best during a conference call.

The Maxine early access program shares preproduction and prerelease builds of upcoming features in order to get feedback from developers on the utility and refinement of Maxine models. In this release we are asking developers for feedback on features early in the development pipeline including:

  • Maxine 3D: Previously shown as a research demonstration at SIGGRAPH 2023, this cloud microservice offers a new level of engagement for video conferencing with real-time NeRF technology lifting 2D video to 3D.
  • Video Relighting: This new model uses a high-dynamic-range image to light the user, enabling seamless matching of user lighting with various background images.
  • API Endpoints: API Endpoints offer developers the flexibility of accessing Maxine features through NVIDIA cloud infrastructure, making Maxine integration even easier.

Jugo and Arsenal Football Club Score Major Goals 

Sporting events are the ultimate human experience, uniting teams and fans beyond borders and language barriers. Jugo, using Maxine’s AI Green Screen feature, offers a digital platform for virtual events that enables companies to create immersive experiences with Unreal Engine that bring fans together from all over the world without the use of a full production studio.

Arsenal FC, a powerhouse franchise in England’s Premier League, is collaborating with Jugo to revolutionize the way the soccer club engages with its 600 million global fan base. The collaboration offers new virtual sports entertainment experiences to boost engagement for global supporters. Jugo brings the power of real, human interaction into Arsenal events, creating realistic virtual connections between supporters and the club’s sports heroes.

“The Jugo Experience platform is transforming the market for brands in their pursuit of global awareness and engagement,” said Richard Stirk, CEO of Jugo Experience. “Arsenal F.C. is the perfect example of a global brand extension. The flexibility in creating an immersive brand experience is a key to Jugo’s offering and the Maxine AI Developer Platform is a basic building block of this flexibility.”

Setting a New Standard of AI-Enhanced Video Conferencing 

Among the first customers to tap into the newest set of features within the early access program to create a professional audio-visual studio from commodity cameras and microphones are Gemelo, Pexip, Spectacle and VideoRequest.

“Gemelo has been involved in testing prerelease builds of Maxine models for a number of years now, and we value the chance to give early input on Maxine features as they’re developed,” said Paul Jaski, CEO of Gemelo. “The latest feature, Speech Live Portrait, will provide our customers with greater flexibility in creating customized video messaging, opening the doors to a new era of personalization.”

“Pexip welcomes the chance to test development versions of Maxine features and help guide the final product models,” said Ian Mortimer, chief technology officer at Pexip. “In testing the newest version of Maxine BNR, we are seeing significant improvements in intelligibility and speech quality and plan to continue refining our testing parameters to help optimize for accuracy in AI translation pipelines.”

“The NVIDIA Maxine Eye Contact API significantly simplified our path to providing engaging video processing capabilities to the users of our Spectacle app, eliminating the need to worry about infrastructure and resource-intensive integrations,” said Benjamin Portman, president of Spectacle. “With it, we were able to create a proof of concept within a matter of days, speeding up our production application deployment timeline.”

“Our early testing of Maxine Studio Voice enabled an impressive look into what is now possible with AI-enhanced production and video testimonials,” said Joe Tyler, chief technology officer at VideoRequest. “The new Maxine BNR and Eye Contact features will help elevate the quality of our customer’s videos by overcoming their challenging recording environments.”

Availability 

Learn more about NVIDIA Maxine, which is available now on NVIDIA AI Enterprise.

See notice regarding software product information.

Read More

NVIDIA Edify Unlocks 3D Generative AI, New Image Controls for Visual Content Providers

NVIDIA Edify Unlocks 3D Generative AI, New Image Controls for Visual Content Providers

NVIDIA Edify, a multimodal architecture for visual generative AI, is entering a new dimension.

3D asset generation is among the latest capabilities Edify offers developers and visual content providers, who will also be able to exert more creative control over AI image generation.

Multimedia content and data provider Shutterstock is rolling out early access to an API, or application programming interface, built on the Edify architecture that lets creators use text prompts or images to rapidly generate 3D objects for virtual scenes.

Visual content creator and marketplace Getty Images will add custom fine-tuning capabilities to its commercially safe Generative AI service, helping enterprise customers generate visuals that adhere to brand guidelines and styles. The service will also incorporate new features to offer customers even further control of their generated images.

Developers can test drive pretrained Edify models by Getty Images and Shutterstock as APIs through NVIDIA NIM, a collection of microservices for inference announced today at NVIDIA GTC. Developers can also train and deploy their own generative AI models using the Edify architecture through NVIDIA Picasso, an AI foundry built on NVIDIA DGX Cloud.

NVIDIA and Adobe are collaborating to bring new 3D generative AI technologies built on Edify to millions of Firefly and Creative Cloud creators.

Livestreaming platform Be.Live is using the NVIDIA Picasso foundry service to provide real-time generative AI that enables the automated creation of visuals and an interactive experience for audiences. Bria, a holistic platform tailored for businesses developing responsible visual generative AI, has adopted Picasso to run inference. And creative studio Cuebric is enhancing filmmaking and content creation by developing Picasso-powered generative AI applications to build immersive virtual environments.

Speedy 3D: Shutterstock 3D AI Generator Now in Early Access

Shutterstock’s 3D AI Services, available in early access, will enable creators to generate virtual objects for set dressing and ideation. This capability can drastically reduce the time needed to prototype a scene, giving artists more time to focus on hero characters and objects.

Shutterstock 3D generator in action. Video courtesy of Shutterstock.

Using the tools, creative professionals will be able to rapidly create assets from text prompts or reference images and choose from a selection of popular 3D formats to export their files. The Edify 3D-based service will also come with built-in safeguards to filter generated content.

The commercially safe model was trained on Shutterstock’s licensed data. Shutterstock has compensated hundreds of thousands of artists, with anticipated payments to millions more, for the role their content IP has played in training generative technology.

3d generated rainforest flora and fauna
Assets created using Shutterstock 3D AI generator, rendered and arranged as a flat-lay image. Image courtesy of Shutterstock.

At GTC, HP and Shutterstock are showcasing a collaboration to enhance custom 3D printing using Edify 3D, providing designers with limitless prototype options.

Shutterstock’s 3D AI generator enables designers to rapidly iterate on concepts, creating digital assets that HP can convert to 3D printable models through automated workflows. HP 3D printers will then turn these models into physical prototypes to help inspire product designs.

Mattel is enabling 3D generative AI capabilities from Shutterstock that can accelerate the design ideation process. With AI tools, toy designers can visualize their ideas for new products with simple text descriptions. By lowering the technical barrier to creating high-fidelity concept design, designers can explore a broader pool of their ideas and iterate faster.

Shutterstock is also building Edify-based tools to light 3D scenes using 360 HDRi environments generated from text or image prompts.

Dassault Systèmes, through its leading 3DEXCITE applications for 3D content creation, and CGI studio Katana are incorporating Shutterstock generative 360 HDRi APIs into their workflows based on NVIDIA Omniverse, a computing platform for developing Universal Scene Description (OpenUSD)-based 3D workflows and applications.

Accenture Song, the world’s largest tech-powered creative group, is using the Omniverse platform to generate high-fidelity Defender vehicles from computer-aided design data for marketing purposes. Coupled with generative AI microservices powered by Edify, Accenture Song is enabling the creation of cinematic, interactive 3D environments via conversational prompts. The result is a fully immersive 3D scene that harmonizes realistic generated environments with a digital twin of the Defender vehicle.

Take Control: Turn Creative Vision Into Reality With Custom Fine-Tuning From Getty Images

Getty Images continues to expand the capabilities offered through its commercially safe generative AI service, which provides users indemnification for the content they generate.

At January’s CES show, Getty Images released Edify-powered APIs for inpainting, to add, remove or replace objects in an image, and outpainting, to expand the creative canvas. Those features are now available on both Gettyimages.com and iStock.com.

Starting in May, Getty Images will also offer services to custom fine-tune the Edify foundation model to a company’s brand and visual style.

The services will feature a no-code, self-service method for companies to upload a proprietary dataset, review automatically generated tags, submit fine-tuning tasks and review the results before deploying to production.

As part of custom fine-tuning tools, Getty Images will release a collection of APIs that provide finer control over image output, one of the biggest challenges in generative AI.

Developers will soon be able to access Sketch, Depth and Segmentation features — which allow users to provide a sketch to guide the AI’s image generation; copy the composition of a reference image via depth map; and segment parts of an image to add, remove or retouch a character or object.

Getty Images’ API services are already being used by leading creatives and advertisers, including:

  • Dentsu Inc.: The Japan-based ad agency is using Getty Images’ generative AI API service to power MAFA: Manga Anime For All, an app that can create manga and anime-style content for marketing use cases. Dentsu Creative is also using NVIDIA Picasso to fine-tune Getty Images’ model for leading membership warehouse retailer Sam’s Club.
  • McCann: The creative agency harnessed generative AI to help develop an innovative game for its client Reckitt’s over-the-counter cold medicine Mucinex, in which customers can interact with the brand’s mascot.
  • Refik Anadol Studio: Known for using generative AI in its artwork, the studio will be showcasing a rainforest-inspired art installation at GTC, created using Getty Images’ AI model fine-tuned with Refik’s rainforest catalog.
  • WPP: The marketing and communications services company is partnering with The Coca-Cola Company to explore how fine-tuning Getty Images’ model can help to build custom visuals that meet brand styles and guidelines.
rainforest-themed AI art
Large Nature Model: Living Archive installation at GTC 2024 by Refik Anadol Studios

Learn more about NVIDIA Picasso and try Edify-powered NIMs from Getty Images and Shutterstock at ai.nvidia.com.

Discover the latest in generative AI at NVIDIA GTC, a global AI developer conference, running in San Jose, Calif., and online through Thursday, March 21. 

Watch the GTC keynote address by NVIDIA founder and CEO Jensen Huang below:

Collage at top shows assets created by Edify-powered Shutterstock 3D AI generator on left, courtesy of Shutterstock. Images on right show Edify sketch-to-image capabilities, demonstrated by NVIDIA.

Read More

Make It So: Software Speeds Journey to Post-Quantum Cryptography

Make It So: Software Speeds Journey to Post-Quantum Cryptography

The journey to the future of secure communications is about to jump to warp drive.

NVIDIA cuPQC brings accelerated computing to developers working on cryptography for the age of quantum computing. The cuPQC library harnesses the parallelism of GPUs for their most demanding security algorithms.

Refactoring Security for the Quantum Era  

Researchers have known for years that quantum computers will be able to break the public keys used today to secure communications. As these systems approach readiness, government and industry initiatives have been ramping up to address this vital issue.

The U.S. National Institute of Standards and Technology, for example, is expected to introduce the first standard algorithms for post-quantum cryptography as early as this year.

Cryptographers working on advanced algorithms to replace today’s public keys need powerful systems to design and test their work.

Hopper Delivers up to 500x Speedups With cuPQC

In its first benchmarks, cuPQC accelerated Kyber — an algorithm proposed as a standard for securing quantum-resistant keys — by up to 500x running on an NVIDIA H100 Tensor Core GPU compared with a CPU.

The speedups will be even greater with NVIDIA Blackwell architecture GPUs, given Blackwell’s enhancements for the integer math used in cryptography and other high performance computing workloads.

“Securing data against quantum threats is a critically important problem, and we’re excited to work with NVIDIA to optimize post-quantum cryptography,” said Douglas Stebila, co-founder of the Open Quantum Safe project, a group spearheading work in the emerging field.

Accelerating Community Efforts

The project is a part of the newly formed Post-Quantum Cryptography Alliance, hosted by the Linux Foundation.

The alliance funds open source projects to develop post-quantum libraries and applications. NVIDIA is a member of the alliance with seats on both its steering and technical committees.

NVIDIA is also collaborating with cloud service providers such as Amazon Web Services (AWS), Google Cloud and Microsoft Azure on testing cuPQC.

In addition, leading companies in post-quantum cryptography such as EvolutionQ, PQShield, QuSecure and SandboxAQ are collaborating with NVIDIA, many with plans to integrate cuPQC into their offerings.

“Different use cases will require a range of approaches for optimal acceleration,” said Ben Packman, a senior vice president at PQShield. “We are delighted to explore cuPQC with NVIDIA.”

Learn More at GTC

Developers working on post-quantum cryptography can sign up for updates on cuPQC here.

To learn more, watch a session about how NVIDIA is advancing quantum computing and attend an expert panel on the topic at NVIDIA GTC, a global AI conference, running through March 21 at the San Jose Convention Center and online.

Get the full view from NVIDIA founder and CEO Jensen Huang in his GTC keynote.

Read More

NVIDIA Brings Generative AI for Digital Humans, New RTX Technologies and More DLSS 3.5 Games to GDC

NVIDIA Brings Generative AI for Digital Humans, New RTX Technologies and More DLSS 3.5 Games to GDC

Generative AI is capable of creating more realistic verbal and facial expressions for digital humans than ever before.

This week at GDC 2024, NVIDIA announced that leading AI application developers across a wide range of industries are using NVIDIA digital human technologies to create lifelike avatars for commercial applications and dynamic game characters. NVIDIA enables developers with state-of-the-art digital human technologies, including NVIDIA ACE for speech and animation, NVIDIA NeMo for language, and NVIDIA RTX for ray-traced rendering.

Developers showcased new digital human technology demos that used NVIDIA ACE microservices at GDC.

Embracing ACE: Partners Transforming Pixels Into Personalities  

Top game and digital human developers are pioneering ways ACE and generative AI technologies can be used to transform interactions between players and NPCs in games and applications.

Developers embracing ACE include: Convai, Cyber Agent, Data Monsters, Deloitte, HippocraticAI, IGOODI, Inworld AI, Media.Monks, miHoYo, NetEase Games, Perfect World Games, Openstream, OurPalm, Quantiphi, Rakuten Securities, Slalom, SoftServe, Tencent, Ubisoft, UneeQ and Unions Avatars.

Demos Showcase New NVIDIA Digital Human Technologies

NVIDIA worked with developers Inworld AI and UneeQ on a series of new demos to display the potential of digital human technologies.

Inworld AI created Covert Protocol in partnership with NVIDIA, allowing players to become a skilled private detective, pushing the possibilities of non-playable character interactions. The demo taps into NVIDIA Riva automatic speech recognition (ASR) and NVIDIA Audio2Face microservices alongside the Inworld Engine.

The Inworld Engine brings together cognition, perception and behavior systems to create an immersive narrative along with the beautifully crafted RTX-rendered environments and art.

UneeQ is a digital human platform specialized in creating high-fidelity AI-powered 3D avatars for a range of enterprise applications. UneeQ’s digital humans power interactive experiences for brands enabling them to communicate with customers in real-time to give them confidence in their purchases. UneeQ integrated NVIDIA Audio2Face microservice into its platform and combined it with Synanim ML to create highly realistic avatars for a better customer experience and engagement.

New NVIDIA RTX Technologies for Dynamic Scenes

NVIDIA RTX revolutionized gaming several years ago by offering a collection of rendering technologies that enable real-time path tracing in games and applications.

The latest addition, Neural Radiance Cache (NRC), is an AI-driven RTX algorithm to handle indirect lighting in fully dynamic scenes, without the need to bake static lighting for geometry and materials beforehand.

Adding flexibility for developers, NVIDIA is introducing Spatial Hash Radiance Cache (SHaRC), which offers similar benefits as NRC but without using a neural network, and with compatibility on any DirectX or Vulkan ray tracing-capable GPU.

RTX. It’s On: More RTX and DLSS 3.5 Titles 

There are now over 500 RTX games and applications that have revolutionized the ways people play and create with ray tracing, NVIDIA DLSS and AI-powered technologies.  And gamers have a lot to look forward to with more full ray tracing and DLSS 3.5 titles coming.

Our latest innovation, NVIDIA DLSS 3.5, features new DLSS Ray Reconstruction technology. When activated, DLSS Ray Reconstruction replaces hand-tuned ray tracing denoisers with a new unified AI model that enhances ray tracing in supported games, elevating image quality to new heights.

Full Ray Tracing and DLSS 3.5 are coming to both Black Myth: Wukong and NARAKA: BLADEPOINT. And Portal with RTX is available now with RTX DLSS 3.5, enhancing its already beautiful full ray tracing. DLSS 3.5 With Ray Reconstruction will also be coming soon to the NVIDIA RTX Remix Open Beta, enabling modders to add RTX technologies like full ray tracing and DLSS 3.5 into classic games.

Star Wars Outlaws will launch with DLSS 3 and ray-traced effects. Ray tracing joins DLSS 3 in Diablo IV March 26. The First Berserker: Khazan will launch with DLSS 3. And Sengoku Destiny introduced support for DLSS 3 and is available now.

See our Partner Ecosystem at GDC

NVIDIA and our partners will showcase the latest in digital human technologies throughout the week of GDC. Here’s a quick snapshot:

  • Inworld AI (Booth P1615): Attendees will get the chance to try out Covert Protocol for themselves live at GDC.
  • Oracle Cloud Infrastructure (Booth S941): See Covert Protocol in action, and discover the “code assist” ability of Retrieval Augmented Generation (RAG). Register and join an exclusive networking event with Oracle and NVIDIA AI experts on March 21, open to women and allies in the gaming industry to build connections with leading voices in AI.
  • Dell Technologies and International Game Developer Association (Booth S1341): Playtest while building a game-ready asset on a workstation with large GPU memory. Speak to Sophie, an AI-powered assistant created by UneeQ and powered by NVIDIA ACE. Attendees can see the latest debugging and profiling tools for making ray-traced games​ in the latest Nsight Graphics ray tracing demo.
  • AWS: Developers can register and join NVIDIA, AWS, game studios, and technology partners as they discuss the game tech used to build, innovate, and maximize growth of today’s games at the AWS for Games Partner Showcase on March 20th.

Stop by these key sessions:

  • Transforming Gameplay with AI NPCs: This session featuring Nathan Yu, director of product at Inworld AI, Rajiv Gandhi, master principal cloud architect at Oracle Cloud and Yasmina Benkhoui, generative AI strategic partnerships lead at NVIDIA, will showcase successful examples of developers using AI NPCs to drive core game loops and mechanics that keep players engaged and immersed. Attendees will gain a deeper understanding of the potential of AI NPCs to create new and immersive experiences for players.
  • Alan Wake 2: A Deep Dive into Path Tracing Technology: This session will take a deep dive into the path tracing and NVIDIA DLSS Ray Reconstruction technologies implemented in Remedy Entertainment’s Alan Wake 2. Developers can discover how these cutting-edge techniques can enhance the visual experience of games.

Download our show guide to keep this summary on hand while at the show.

Get Started

Developers can start their journey on NVIDIA ACE by applying for our early access program to get in-development AI models.

If you want to explore available models, evaluate and access NVIDIA NIM, a set of easy-to-use microservices designed to accelerate the deployment of generative AI, for RIVA ASR and Audio2Face on ai.nvidia.com today.

RTXGI’s NRC and SHaRC algorithms are also available now as an experimental branch.

Read More