Home Blog Page 164

Visual Search Is One of the Biggest Trends – Know its Importance

importance of visual search
Image Credit: Canva

Everything is temporary except evolution. However, in the last few decades, the world has evolved at an unprecedented pace. There was a time when only written text was used for communication and information. However, pictures are now replacing words as people use them to communicate. It forced data scientists and IT experts to modify technologies according to the images’ popularity and users’ needs.

One of the most valuable techniques developed was the visual search method. Although this method was introduced a few years ago, it gained popularity quite late. However, it has become a trend these days to search for information. 

Most people rely on the picture search method to find images on the web as, in most cases, it is more accurate and efficient than word-based searches. The primary reason behind that is that it doesn’t show the results based on the SEO algorithms but the relevancy.

This article is written to help those still using old methods of digging for information. It can help internet users know the importance of this extraordinary technology and why they should prefer it over text inputs.

The image search method gained massive popularity because of the incredible benefits that it provides to people. As a result, reverse picture search has become an essential tool to use in different circumstances. The factors mentioned below can help you know why it is vital these days.

Provide Better Results

As mentioned above, text-based queries sometimes don’t help people find out what they are looking for. So, people must try multiple questions to find the most accurate and relevant data. On the contrary, the reverse photo lookup technique is comparatively much better than text inputs.

Most image search engines use machine learning and artificial intelligence to analyze the visual inputs, providing the exact results people need. These technologies make the search results far more relevant than other inputs. Due to its precision, many people don’t prefer text and audio queries until they are bound to do it.

Find out Who Are Using Your Pictures

Besides providing many other benefits, the reverse image lookup can also help people know who is illegally or unethically using their images. Once you upload a picture on a social media platform or web, you don’t have any control over it. Anyone can save it on their devices and use it for their benefit.

Keeping a check on your pictures is essential for you. There are some chances that scammers can use them to harm your reputation or for some propaganda purposes. Visual search methods can help you find images on the web that are uploaded anywhere else. Besides, you can also learn for what purpose these pictures are used. Once you know about them, you can secure your reputation before damaging it or take legal action against those using your pictorial data without your consent.

Save People from Scams

Scammers have found many ways to loot or trap people in this modern age. Catfishing is the most common among them. In this scam, fraudsters use other people’s pictorial data to create social media profiles, contact their friends and family members and ask for money. Every year, tons of people lose their money due to this new type of fraud. Unfortunately, it is hard to determine whether a catfisher or the real person is talking to you from the other end. 

Similarly, some eCommerce stores also use images of other companies’ products and sell low-quality and fake products. People should get assistance from photo search engines to avoid this issue. These engines will show the web pages and social media profiles where the same images have been uploaded. By analyzing those images, and their upload date, you can quickly know whether a fraudster is targeting you. 

Acquire Information about Different Objects

Sometimes we receive different images on WhatsApp or other platforms with unknown objects. Those objects could be anything like places, people, animals, birds, herbs, etc. Getting information about those images could be tricky using the text as an input query. That’s where image search comes in handy.

With the help of a visual search, you can quickly find out every piece of information regarding those unknown objects. This method allows people to discover other similar objects as well.

Detect Fake News

Social media and other sources of information are full of fake news. The propagandists frequently use fake news to spread misinformation for multiple purposes. For example, they can use it to malign their competitors, gain sympathies, or lead people to make unlawful decisions. Avoiding fake news is essential for people, but many don’t know how to differentiate between fake and real news. However, reverse image search engines and techniques can assist people in this case.

To know the authenticity of the news, people should save the supported pictures and search for them on the photo search facility. It will show the web pages having the same picture. By checking those sources, you can identify whether the real source of news is credible or not.  

Locate Royalty-Free Images

Images are sometimes essential for social media posts, blogs, and eCommerce stores. However, finding or creating relevant pictures is difficult for many people as copyright laws protect some pictures. Therefore, if you use them, you may have to face severe fines. That’s why avoiding them and finding royalty-free images is the only option you may have.

The reverse image search can help people find royalty-free images they can use on social media and blogging websites to support their arguments or any other purpose. In that case, a single image input is better than multiple text queries. 

To Conclude

Due to a massive increase in the use of images over the web, the need for visual search methods has increased. Analyzing the needs, many reverse image search engines are developed that now provide users with a range of benefits.

Advertisement

Salesforce Announces Genie, a Real-time Data Integration Platform

salesforce announces genie

Salesforce, a global cloud solutions provider, has announced Genie at the Dreamforce customer conference, a real-time data integration platform using which enterprises can deliver seamless services across sales, marketing, and commerce. 

David Schmaier, Chief Product Officer and President at Salesforce, said that the company had built Genie to automate every service provision by Customer 360, Salesforce’s customer relationship management (CRM) platform. Salesforce Genie forms the core of real-time Customer 360, collects, stores, and integrates real-time data streams with Salesforce transactional data.

Genie underlines the Salesforce platform by smoothing data movement whenever required. Patrick Stokes, GM and EVP of Salesforce, said, “So we’re announcing that our Customer 360 applications now have access to an entirely new way of bringing data into Salesforce in real time at scale that we’ve never been able to achieve before.”

Read More: NVIDIA announces Omniverse Cloud for metaverse at GTC 2022

Stokes highlighted that Genie is a lakehouse architecture and a modern equivalent of the company’s previous attempts to integrate transactional data in the CRM database. However, Genie is more than just an integration layer added to the platform. 

Genie offers Sales Cloud, Service Cloud, Marketing Cloud, and Commerce Cloud services separately. It also features services for Tableau, MuleSoft, and Slack. A part of its ability to offer such capabilities is developed on Salesforce’s cloud infrastructure, Hyperforce, which offers data security, privacy, and regulatory compliance controls. This ensures customers’ trust and reliance on the platform. 
You can check the entire list of services here.

Advertisement

NVIDIA Releases Maxine to Deliver Breakthrough Audio and Video Quality at Scale

nvidia releases maxine to deliver audio video quality

NVIDIA releases Maxine, a suite of GPU-driven software development kits (SDKs) to deliver breakthrough audio and video quality. Maxine enables clear communications via its cloud-native microservices for augmented-reality effects and audio-video enhancement. 

With the early-access release of Maxine’s audio effects, the company said that Maxine would be re-architected for cloud-native microservices. Additionally, new SDK capabilities, including Speaker Focus and Face Expression Estimation, were announced, along with the availability of Eye Contact to all users. Updated versions of existing SDK functionalities are also included in NVIDIA Maxine.

Maxine provides three updated GPU-accelerated SDKs for audio, video, and AR effects that revolutionize real-time communications with AI. A new feature called Speaker Focus isolates the audio tracks of foreground and background speakers to make each voice more audible. Lastly, the Audio Super Resolution SDK function has also received an upgrade with better quality.

Read More: New NVIDIA DGX System Software and Infrastructure Solutions Supercharge Enterprise AI

The video effects SDK uses a regular webcam to produce AI-based video effects. Enhancements to temporal stability have been made to the Virtual Background function, which divides a person’s profile into sections and uses AI-powered background removal, replacement, or blur.

Additionally, the AR SDK offers typical web camera feed-based, real-time 3D face tracking and body pose estimation driven by AI.

Other cloud-native microservices offered by Maxine will enable developers to create real-time AI applications. These services may be autonomously managed and deployed on the cloud, speeding up implementation time. Some of these microservices are:

  • Background Noise Removal
  • Room Echo Removal
  • Audio Super Resolution
  • Acoustic Echo Cancellation

Maxine is a part of the NVIDIA Omniverse Avatar Cloud Engine, a set of cloud-based AI models and services that developers may use to create, personalize, and use interactive avatars. You can refer to the GTC keynote for more information. 

Advertisement

New NVIDIA DGX System Software and Infrastructure Solutions Supercharge Enterprise AI

new nvidia dgx system software

During the GTC event, NVIDIA announced its new DGX system software and infrastructure to power innovation in enterprise AI development. The company announced that NVIDIA DGX H100 systems are now available for order. Based on the latest GPU chips, these systems will form the building blocks for NVIDIA’s full-stack AI solutions. 

The company launched the new NVIDIA Base Command software to simplify and accelerate AI developments by powering the DGX systems. The software will enable enterprises to tap the potential of their investment in NVIDIA’s DGX systems for orchestration and network infrastructure.

NVIDIA unveiled the DGX BasePOD to make AI deployments simpler and faster. The BasePOD provides an architectural framework for all DGX computing, storage, network, and software systems. 

Read More: Harvard and Stanford developed self-supervised AI to detect disease using NLP-based reports

The company has also created an advanced version of the BasePOD, the NVIDIA DGX SuperPOD. The DGX SuperPOD is a comprehensive hardware, software, and services package that removes the guesswork from developing and deploying AI infrastructure in any enterprise, making it the fastest route to AI innovation.

The GTC event also unveiled the NVIDIA Partner Network, a network of fully integrated and readily deployable offerings provided to valued partners. The program is intended for business models, including value-added reselling, solutions integration, system design or manufacture, hosting services, consultancy, or NVIDIA products and solutions.

Advertisement

NVIDIA Ramps up the Hopper Architecture and Pushes H100 Chips to Production

nvidia ramps up hopper and h100 chips production

NVIDIA is driving more and more architectural decisions and modifications in its CPU and GPU accelerator engines with each new generation. Jensen Huang, CEO of NVIDIA, announced that the company would ramp up Hopper, an architecture supporting AI workloads. The Hopper architecture is intended to scale diverse workloads for data centers. 

NVIDIA unveiled Hopper in March, along with other advancements like the NVIDIA Grace CPU. This month, the company released benchmark results for the chip in the MLPerf suite of machine learning tasks.

Hopper is built with approximately 80 billion transistors with NVIDIA’s cutting-edge TSMC 4N technology and features multiple innovations to enhance the performance of NVIDIA H100 Tensor Core GPUs. 

Read More: NeMo LLM Service: NVIDIA’s cloud service to make AI less complicated

The company has pushed the H100 Tensor Core GPUs to enter the production zone in total volume. The GPU chips will be shipped to companies including Hewlett Packard, Dell, Cisco Systems, etc. NVIDIA systems with the H100 GPU will enter the market in the first quarter of next year. 

When the company launched the first H100 GPU chip, Huang said, the chips would be “the next generation of accelerated computing.” The H100 chip is designed to accomplish artificial intelligence tasks for data centers. The company claims that H100 chips “dramatically” reduce deployment costs for AI-based programs. For instance, the performance of 320 top-of-the-line A100 GPUs is equivalent to only 64 H100s. 

Advertisement

Nvidia announces Omniverse Cloud for metaverse at GTC 2022

Nvidia announces Omniverse Cloud

Nvidia has announced Nvidia Omniverse Cloud, its first software and infrastructure-as-a-service offering, at Nvidia GTC 2022. It is a suite of cloud services for artists, developers, and enterprise teams to design, publish, operate, and experience metaverse applications anywhere. 

The technology uses the cloud to tap the heavy-duty power of data centers to enable Omniverse tools wherever the users happen to be. More than 700 companies and 200,000 people are using Omniverse now.

Using Omniverse Cloud, individuals and teams can experience in one click the ability to design and collaborate on 3D workflows without the need for any local computing power. Omniverse Cloud will leverage Nvidia’s cloud gaming solution, GeForce Now, which has a global graphics delivery network.

Read More: NVIDIA Announces Omniverse Avatar Cloud Engine, A Suite Of Cloud-Native AI Models And Services

“The next evolution of the internet called the metaverse will be extended with 3D,” said Richard Kerris, vice president of the Omniverse at Nvidia. “To understand what the impact of that will be, the traditional internet that we know today connects websites described in HTML and viewed through a browser. The metaverse will be the evolution of that internet connecting virtual 3D worlds using USD, or universal scene description.”

Omniverse Cloud is based on the open Universal Scene Description (USD) standard for interoperable 3D assets.

“The metaverse, the 3D internet, connects virtual 3D worlds described in USD and viewed through a simulation engine,” said Jensen Huang, Nvidia CEO, in a statement. “With Omniverse in the cloud, we can connect teams worldwide to design, build, and operate virtual worlds and digital twins.”

Advertisement

NeMo LLM Service: Nvidia’s cloud service to make AI less complicated

nvidia’s nemo llm service

Nvidia has announced NeMo LLM, its first cloud service to make AI less complicated. NeMo LLM will focus on making large language models more accessible for experimentation and deployment across multiple domains. 

Ian Buck, GM and VP of Accelerated Computing at Nvidia, said that many AI models need to be turned into more accessible applications so that enterprises can fit them in real-world settings. NeMo LLM  adds a layer of intelligence and interactivity to enable user interaction with complex AI models like DALL-E 2. Such language models are trained on billions of parameters, making model tuning a challenging task.

Nvidia’s NeMo LLM service will add a conversational element to many such models across domains like finance, medicine, or technology. Buck said, “This service will help bring large language models to all sorts of different use cases – to generate profit summaries, for product reviews, to build technical Q&A, for medical use cases.”

Read More: From SIGGRAPH to Jetson AGX Orin Production Modules: Latest Announcements by NVIDIA

NeMo LLM takes some pre-trained models like NeMo Megatron (trained on 530 billion parameters), GPT-3 (trained on 175 billion parameters), or T5 (trained on 11 billion parameters); and constructs a domain-based framework around it. This saves the need to train a model from scratch.

Nvidia is also launching the BioNeMo service along with NeMo LLM to provide researchers with access to pre-trained biology and chemistry language models. It is aimed to aid researchers in interacting with and manipulating protein and data for drug discovery. The initial two BioNeMo protein models, ESM-1 and ESM-2, cater to encoding essential biological information of large protein databases and predicting 3D protein structures from amino acid sequences, respectively. 

The NeMo LLM cloud service will be the recent addition to Nvidia’s stable software machines, like RIVA and Merlin.

Advertisement

UNESCO inaugurates 2022 State of the Education Report for India: Artificial Intelligence in Education

UNESCO state of the education report artificial intelligence

The United Nations Educational, Scientific and Cultural Organization, or UNESCO, has inaugurated the State of the Education Report for India: Artificial Intelligence in Education. This is the fourth edition of the annual flagship report of the New Delhi UNESCO office. 

Based on extensive research and study, the report provides insight into the state of artificial intelligence and its market in the country. It talks about AI as a subject and its application in the education sector. As per the report, the Indian AI market will reach a net worth of US $7.8 billion by 2025, showcasing a compound annual growth of 20.2 percent! 

The press release mentions 10 recommendations by UNESCO for promoting AI in education. These recommendations specify AI ethics as a priority, the need for a regulatory framework, effective public-private partnerships, expanding AI literacy, work on correcting algorithmic biases, and a few others. 

Read More: Diffusion Bee: a Mac app that creates AI images with text

Eric Falt, Director at UNESCO, New Delhi, said, “India has made significant strides in its education system, and strong indicators point to the country’s notable efforts to enhance learning outcomes, including by using Artificial Intelligence-powered education technology.” 

He also mentioned that artificial intelligence is one of the areas where the Indian government has advanced and made tremendous strides in the last few years.

Advertisement

Harvard and Stanford developed self-supervised AI to detect disease using NLP-based reports

harvard and stanford ai detect diseases using nlp reports

Stanford University and researchers at the Harvard Medical School have developed an artificial intelligence model that detects abnormalities and diseases by studying NLP-based reports. The AI model does not rely on standard human annotations of X-rays to learn to predict diseases. 

Using AI in medical imaging technologies is not a new advancement. However, many challenges still limit its application to only a handful of clinical applications. A massive amount of data and human annotations must go into training the standard disease prediction models. 

However, the model created by Harvard and Stanford, called CheXzero, has shown accurate results by relying on reports created by NLP rather than human annotations. The model is self-supervised, meaning that it can train itself to learn more. Self-supervised algorithms automatically address the issue of over-dependence on labeled data.

Read More: Diffusion Bee: a Mac app that creates AI images with text

Pranav Rajpurkar, assistant professor at HMS, said, “Up until now, most AI models have relied on manual annotation of huge amounts of data—to the tune of 100,000 images—to achieve a high performance. Our method needs no such disease-specific annotations.”

Researchers have used chest X-rays as an example to show CheXzero’s capabilities, but it can be generalized to a vast array of other medical setups that deal with unstructured data. The AI model helps bypass the requirement of large-scale labeling bottlenecks that have been a long-standing challenge in medical machine learning.

Advertisement

Adept ACT-1: an AI assistant that can browse, search and use web apps like humans

act-1 ai can browse like humans

Adept, an AI and ML product company, announced a large-scale transformer ACT-1, an AI assistant that can browse, search, and use the web like humans. When provided with instructions, the AI model behaves like a personal assistant in software and navigates the web, scrolls, likes, and types whenever required. The company has released a demo video of how ACT-1 works. 

ACT-1 has been developed to work with digital tools, and has recently learned to use a web browser. It connects to a chrome extension that allows it to observe users’ actions in the browser and performs activities like searching and scrolling. The action space includes UI elements on the page, and the observation is rendered across other websites universally. 

Read More: Meta and YouTube to expand policies, research to fight online extremism

ACT-1 can:

  • Process high-level user requests/queries with only a command text. In this case, getting a single task done necessitates conducting activities and noting observations frequently.
  • Work with spreadsheets and exhibit real-world knowledge in inferring context and assisting computations.
  • Combine multiple tools to finish a complex task. 

The large-scale transformer ACT-1 is still in its infancy and will become more useful in the future as it is continually seeking advanced training and enhancements. It is incredibly coachable and can fix errors with just one human feedback. However, there is a potential risk for ACT-1 being misappropriated with hateful input commands. 

Adept plans to work on preventing any possible misuse by utilizing machine learning techniques and carefully staging deployment.

Advertisement