Home Blog Page 116

Data Engineering Courses of 2022

Data engineering course

Over the last few years, most enterprises have undergone a digital transformation and produced unimaginable volumes of data. This raw data is insufficient to push data science projects forward in production. As per Gartner, back in 2017, 85% of data projects failed because the data could not be trusted to facilitate business decisions. Gartner predicted these results because earlier data scientists were expected to work on the data before actually using it in the project. However, it has become apparent that “someone” needs to organize and transform this data to ensure quality, usability, and availability so that data scientists do not spend much time before the actual work begins. Data engineers are the ones who get this job done. You can opt for a data engineering course to learn more about data engineering and get one of the most in-demand jobs in the big data world. 

What do data engineers do?

A data engineer’s primary objective is to transform the raw data into something valuable and understandable before presenting it to an enterprise. In addition, they must design, construct, test, mix, manage, and refine the data using various tools and sources. The goal is to build data pipelines that operate efficiently. Additionally, data engineers work closely with the infrastructure teams to automate several steps in the data engineering procedures. In addition to all of this, they create challenging queries to make the data available.

Top Data Engineering Courses

Several data engineering courses are available, and selecting the right one is challenging. This article has enlisted some knowledgeable courses for a data engineer. Have a look.

  1. Professional Certificate in Data Engineering Fundamentals (IBM)

Professional Certificate in Data Engineering Fundamentals (IBM) is an excellent introductory data engineering course if you are interested in venturing into data engineering. Since data engineers are the core of a data science project as they create pipelines guiding the workflow, it becomes inevitable not to know the fundamentals. This course provides a comprehensive theoretical and practical introduction to building pipelines, managing data, and engineering work ecosystems to lifecycles.

The certification includes three sub-courses:

  • Data Engineering Basics
  • Python Basics for Data Science
  • Relational Databases and SQL.

The course will span over 4 months and take an average of 4-6 hours per week.

  1. Data Engineering with AWS Machine Learning (Pluralsight)

Storing data for complex machine learning projects is tedious because of varying data formats. This data engineering course focuses on how to store data and leverage machine learning on the AWS platform. In this course, Data Engineering with AWS Machine Learning by Pluralsight, you will learn how to select the appropriate AWS service for each data-related activity for any given scenario. Initially, you will investigate data storage options and the purposes of each type of storage. Finally, you will learn to transform raw data into usable formats.

The course will cover several topics that will introduce you to data engineering with AWS. 

  • Typical Data Flow for ML on AWS
  • Database Storage Options for ML on AWS
  • Data Warehouses and Data Lakes
  • Batch Data Ingestion
  • Data-driven Workflow

It is a short course that you can finish within 3 hours and will bring you one step closer to using AWS machine learning services with ease.

  1. Data Engineering Learning Path – Coursera

Data Engineering Learning Path is an excellent umbrella course offered by Coursera with which you can learn essential skills that a data engineer needs. Coursera suggests a combination of sub-courses that will aid you in moving towards a full-fledged data career. The following courses are recommended for a data engineering learning path:

  • Business Intelligence Analyst – Power BI, Tableau, SQL
  • Business Intelligence Developer – Software development, SQL, Javascript
  • Data Engineering – Python, Big Data, ETL

Coursera recommends a Coursera Plus subscription to guide you through multiple courses in a career learning path, with access to over 3000-course options. 

  1. Become a Data Engineer: Mastering the Concepts – LinkedIn Learning

If you are looking for a data engineer course online, LinkedIn Learning offers an extensive beginner-level course, Become a Data Engineer, for those who wish to learn the fundamentals of data engineering from scratch. You will study the core principles of data engineering, DevOps, trade-related tricks, and how to use them in platforms for project work. The course discusses Big Data, SQL, and NoSQL coding for analysis. Moving forward, you will understand how Apache Sparks work with Big Data technologies. 

The course will cover

  • Data Science Foundations
  • NoSQL Essentials
  • Apache Spark Essential Training
  • Architecting Big Data Applications
  • Cloud SQL and SQL Essentials
  • Advanced NoSQL for Data Science and SQL Professionals

It will take approximately 13 hours to cover the entire material, and you will get a certificate on completion. 

  1. Data Engineering – ETL, Web Scraping, Big Data, SQL, Power BI (Udemy)

If you are looking for a big data engineer course, Data Engineering – ETL, Web Scraping, Big Data, SQL, Power BI is a beginner-level data engineering course that will teach you how to interact with data. It covers ETL, Web Scraping, SSIS, SQL, and Big Data.

The crash course is divided into twelve sections covering 134 video lectures covering the following topics:

  • ETL, or Extract, Transforms, and Load, a data pipeline using which people can extract data from several sources, transform it according to the requirements, and load it in a data store. 
  • Secondly, you will also learn about SQL Server Integration Services for data integration, transformation, and solving business problems. 
  • Big Data, including numbers, audio, images, text, and other kinds of data with high volume, variety, and velocity.
  • You will become familiar with SQL, a standard programming language for managing databases.
  • Lastly, you will learn Power BI, a robust business analytics solution that helps with data visualization and business insights.

The course content is about twelve hours long and can be completed flexibly. On completion, you will be able to implement ETL with SSIS, scrap web data with Python, Beautiful Soup, and Scrapy, connect web data with Power BI, and model with Power BI.

Read More: Donald Trump Launches $99 Digital Trading Card NFTs Minted on Polygon

  1. Professional Certificate in Data Engineering (IBM)

After learning data engineering fundamentals, proceeding with another course, like Professional Certificate in Data Engineering by IBM, will be a significant next step. This is one of the best data engineer course in India, designed for people who want to advance their interest and knowledge in the field. It advances the basics while teaching you application development, more complex pipelines, and data warehousing. 

The course is divided into 14 sub-courses that will give you an insight into cloud-based relational databases (RDBMS) and NoSQL databases. Some of these are:

  • Python for Data Engineering
  • SQL for Data Engineers
  • Building ETL and Data Pipelines
  • Big Data Engineering, Hadoop, and Spark Basics
  • Data Engineering Capstone Project

The course spans over one year and two months, with an average of 3-4 hours per week. On completion, you will have acquired skills in Hadoop, Big Data, PostgreSQL, Bash, Data Warehousing, and other related technologies.  

  1. Microsoft Azure Data Engineering Associate DP-203 Exam Prep Specialization

It is not a standard course like other data engineering courses. However, opting for Microsoft Azure Data Engineering Associate Exam Prep Specialization will give you a different insight into data engineering. It is a rewarding path to being an associate with Microsoft, where you will learn about basic theoretical concepts and get hands-on experience with real-world scenarios.

The specialization program will cover the following sub-courses:

  • Data Engineering with Microsoft Azure
  • Data Storage and Integration
  • Data Warehousing and Engineering
  • Preparation for Data Engineering on Microsoft Azure Exam

It will take approximately thirteen months to complete, with an average of two hours per week. On completion, you learn about Azure Synapse Analytics, Apache Spark, Modern Data Warehousing, Azure Data Lake Storage, and other related technologies.

  1. AWS Solutions Architect Associate Certificate Prep

A data engineer must know at least one cloud service provider and its services. Amazon Web Services (AWS) is an industry leader in cloud computing. Data engineers acquainted with an AWS Certified Solutions Architect – Associate (SAA) have better chances at career profiles and high earnings. In this intermediate-level course, AWS Solutions Architect Associate Certificate Prep, you will get expert guidance on how and what to prepare for the examination. 

The first week talks about multi-tier data solutions and storage technologies. The following week talk about flexible and scalable computing solutions and database networks. In week three, you will learn how to secure your data and database network. Lastly, the fourth week will teach you computing and database services cost optimization.

The month-long course comes with flexible deadlines, sample certification questions, and skill-based hands-on exercises on data structures and architectures.

  1. Taming Big Data with Apache Spark and Python

This Big Data engineering course, Taming Big Data with Apache Spark and Python on Udemy, focuses on Big Data analysis using Apache Spark and Python. With more than 20 hands-on examples with large data sets, you will learn to use DataFrames, structured streaming with Spark 3, and MLLib for ML-driven data mining and other related concepts. The course is divided into eight sections, covering 66 video lectures. These sections are structured to cover the following concepts:

  • Introduction to Spark and RDD interface
  • SparkSQL, DataFrames, and DataSets
  • Spark Clusters and Spark ML
  • Spark Streaming and Graph X

The course will take approximately seven hours, with access to a personal Windows/Linux computer and some prior scripting experience.

  1. Data Structures and Algorithms Nanodegree (Udacity)

In this data engineering course, Data Structures and Algorithms Nanodegree from Udacity, you will be acquainted with more than 100 data structures. Data engineers should know their way around multiple data structures and algorithms to be proficient in managing and sorting data. Knowing about data structures also makes them capable of understanding patterns in data and deciding appropriate operations. During the course, industry experts will deliver online lectures on Udacity’s platform and provide personalized project reviews. Once you finish the course, your project will undergo a strict review process to get certified. 

The certification will cover three sub-courses:

  • Data Structures
  • Basic Algorithms
  • Advanced Algorithms.

You need to have a basic knowledge of Python and Algebra to enroll in the course over 4 months, with an average of 10 hours per week.

Advertisement

Power BI Dashboard Examples

power bi dashboard examples

Businesses have shifted to the digital world and are entirely driven by the digital data they accumulate. This data is of significant value as it depicts how consumers interact with their products and services. However, you need to have a robust analytical tool to collect and analyze it. Microsoft’s Power BI, a powerful business analytics tool, provides a platform for data collection, analysis, and visualization through appealing dashboards and interactive reports, enabling companies to boost profitability and unearth deeper insights. Power BI dashboards are an essential visualization technique that offers a 360-degree perspective for speedy insight-gathering. 

Before proceeding with the examples, let’s look at what Power BI dashboards are.

What is a Power BI Dashboard?

A Power BI Dashboard is a canvas that showcases essential data points in multiple forms within a single page. Several dashboards constitute the BI reports. Only the most significant parts of the data story are included in well-designed dashboards. A specific element can be clicked to see the main report. Dashboards are beneficial for tracking the progress of your company’s operations, sales, or other key metrics. It gives you a bird’s-eye view of your company and aids in developing data-supported action plans. Additionally, there are numerous Power BI dashboard designs from which you can choose.

Top 10 Power BI Dashboards Examples

This article presents the Top 10 Power BI Dashboard Examples that will help you comprehend how Power BI dashboard samples can be utilized to demonstrate various scenarios and provide insights through thoughtfully planned and selected KPIs.

  1. E-Commerce Sales Power BI Dashboard

Online retailers can use this interactive Power BI template to evaluate the performance of various products from both a broad view and a granular one. It offers a summary of overall sales and has the option to display annual, quarterly, and monthly growth rates. Additionally, it enables retailers to explore and comprehend the best-performing goods, areas, and more.

This Power BI dashboard can filter sales by location, period, average order value, or other desired criteria. They can also be customized to display why certain products have been returned and summarize the product delivery status.

Benefits of using an E-commerce Dashboard:

  • Get a bird’s eye view of business performance.
  • Saves time by allowing real-time information analysis.
  • Eliminates analysis paralysis by facilitating faster decision-making.
  1. Inventory Stock Analysis Power BI Dashboard Inspiration

Inventory management is essential to keep track of the supplies and materials you need to run a business effectively. Consequently, Inventory Power BI dashboard examples give you better access to your stock and enable you to undertake real-time inventory management.

This Power BI inventory dashboard sample is essential as it can be used by organizations of all sizes across multiple industries, including retail, FMCG, manufacturing, hospitality, education, and restaurants, to run efficiently and meet client demand. 

In addition to the stock, this dashboard can display customer reviews, most viewed, least viewed, and unviewed products. Based on the Fulfillment Cycle and MarkDown variance, you may anticipate stock availability and restocking cycles.

Benefits of using an Inventory Stock Analysis Dashboard:

  • Prevents stockouts and project delays.
  • Better cash flows as the dashboard highlight the business’s lean and rush periods.
  • Reduces inventory wastage.
  1. Price-Volume-Mix Variance (PVM) Analysis Dashboards

A standard business dashboard visualizes revenue, income, gross profits, etc. These metrics are used to compare ex-post and ex-ante business plans and forecasts using time references, geography, or product lines. However, a Price-Volume-Mix (PVM) variance analysis showcases how sales volume, product mix, and prices affect revenue. 

In this Power BI dashboard example, the factors that contributed most to each category’s revenue growth are highlighted, including price, volume & product mix fluctuation, and new product releases and discontinued goods. Such dashboards are handy for product managers and their teams to pinpoint critical problems and opportunities. 

Benefits of using a PVM Analysis Dashboard:

  • Helps to refine pricing policies to maximize profits.
  • Facilitates a more informed, data-driven understanding of the organization.
  • Pinpoints granular details due to its bottom-up approach.
  1. COVID-19 Power BI Dashboard Example

Undeniably, the COVID-19 pandemic has been the talk of the town for the past two years. A Power BI dashboard representing the distribution and impact of the virus is an excellent example of depicting Power BI’s capabilities. This dashboard can compare and contrast mortality and recovery rates across different countries. 

As seen above, this dashboard can also show the distribution of active and recovered cases in a country-wise distribution. It can also track the death toll and visualize the pattern to forecast future states. 

Benefits of using a COVID-19 Dashboard:

  • Helps in tracking active and recovered Covid patients.
  • Based on the trends, people can plan their work/travel.
  1. Sentiment Analysis Dashboards

Sentiment analysis is an artificial intelligence-driven technique that analyzes unstructured data to draw opinions and emotions. These outcomes can be represented in eye-catching charts and graphs with business intelligence software like Microsoft’s Power BI to get clear insights. These dashboards are extremely useful as sentiment analysis datasets are limited to research and used to develop rule-based models and advance artificial intelligence-driven techniques.  

For instance, consider a social media business owner who posts about their products. This Sentiment Analysis Interactive Dashboard shows various analytical points in data accessible from online retailers. The dashboard also makes it possible to determine what customers think of a product and other details like customer satisfaction and discontent ratings for the same products. 

Benefits of using a Sentiment Analysis Dashboard

  • It can help you recognize the happiest customers.
  • These dashboards help in monitoring agent efficiency and performance.
  • Even website chatbots can benefit from a sentiment analysis dashboard to recognize customer moods.

Read More: PubMed GPT: A GPT Model Trained On PubMed Biomedical Papers At Stanford

  1. Social Media Monitoring & Analytics Dashboards

Nowadays, most businesses utilize social media platforms to market their products and services. Keeping track of your business’s performance on social media platforms becomes vital to see whether your efforts are reaping any benefits. Using Power BI dashboard templates for social media analytics, business owners can develop better marketing strategies, alter branding, and improve consumer engagement.

For instance, this dashboard displays monthly information about many social media subjects, such as web sources, the volume of online discussions, online influencers, unique categories, sentiment analysis, quotes, geolocation, and many other topics, making it one of the most common Power BI dashboard examples.

Such Social Media Dashboards can be extended to obtain tags and mentions, view good/nasty remarks, locating influencers by geolocations. It also enables the users to track their mentions in the sentiment analysis by hourly/daily filtering. 

Benefits of using a Social Media Dashboard

  • Enhanced productivity and engagement with followers.
  • It can help you understand collaborations and whether they have been effective or not.
  • This dashboard can help you in scheduling your posts.
  1. Financial Analytics Dashboards

All organizations hold potentially valuable financial data. A Power BI Financial Analytics Dashboard is designed to study finances and infer trends for executive-level professionals. 

The dashboard gives a broad overview of the company’s financial performance throughout. Professionals can also explore financial performance according to the area and product category and identify financial patterns and regions of over/underperformance in these areas. These preliminary insights can then be used to choose where to concentrate their efforts.

For instance, this financial dashboard displays revenue, liabilities, expenses, gross profit, and other financial assets. Similarly, a financial dashboard can display the profit and loss statement, aggregate revenues, and balances.

Benefits of using a Financial Analytics Dashboard

  • It summarizes your financial performance.
  • It demarcates expenses accruing to different activities and their corresponding returns.
  • Using the insights, the organization’s financial growth can be tracked over time.
  1. Human Resource Analytics Dashboards

Human resource analytics is essential in employee management and reaching business goals. Given the available data, it is challenging to draw actionable insights quickly. 

The next on our Power BI dashboard examples’ list provides several human resource analytics dashboard samples to make it easier, using which HR professionals can analyze employees and make better data-driven decisions. 

It is a visual representation of key performance indicators (KPI) and HR data that provides an overview of the present situation and makes vital information easily accessible. For instance, this HR dashboard summarizes employee metrics by gender, age, absenteeism rate, contract types, terminations, and other related factors.

Benefits of using a Human Resource Analytics Dashboard:

  • These dashboards help identify effective measures and the ones that are not.
  • They help in tracking absentees and annual leaves taken by employees.
  • They also give insights into the hiring practices and trends in the organization.
  1. Global Oil Production and Consumption Dashboards

Big data visualization is a challenge in tracking crucial parameters, output figures, costs, etc., concerning the oil and gas industry. Power BI dashboards provide a revitalizing visual analytical solution for the entire industry.

The dashboard on our list of some top Power BI dashboard examples is one that compiles all essential global oil production and consumption parameters to make it simpler for people to quickly and easily review massive amounts of oil production and consumption data in real-time. The top 3 oil metrics for all countries—reserve, output, and consumption—are shown in this dashboard’s country tab.

The production tab in this dashboard also highlights the most extensive oil reserves, the oil production timeline, and the total production by each country. On the other hand, the consumption tab highlights the consumption metrics.

Benefits of using a Global Oil Production and Consumption Dashboard:

  • It can assist employees in making more informed decisions.
  • It helps in tracking the oil production of different countries. 
  • Oil is scarce, and such a Power BI dashboard development helps track and manage oil reserves efficiently. 
  1. Email Engagement Analytics Dashboard

These Power BI Dashboard designs are intended for businesses advertising their products or services via mass emails. These dashboards show the percentage of delivered, clicked, and opened emails. These dashboards typically use data from campaign management programs and show how these indicators changed during the relevant time frame.

As you can see here, this is one of those Power BI dashboard examples that display several email metrics and a month-wise comparison. These boards can be customized to include other relevant metrics like email-driven sales by categories, recurring customers, etc.

Benefits of using an Email Engagement Analysis Dashboard:

  • It saves time as it displays all vital metrics on a single dashboard.
  • Visual insights are easier to draw conclusions and decide whether the emails are effective or not.
  • These dashboard insights can also be used to schedule emails for geolocations in different time zones.

Power BI Dashboards vs Reports

While these Power BI dashboards may seem like a report as they summarize performance or common metrics, they are fundamentally different. Here are a few differentiators:

  • A dashboard in Power BI may pin visuals after drawing insights from multiple datasets, but a report usually focuses and cater to a single dataset at once.
  • Dashboards are a single-page summary, whereas reports may take as many pages as required.
  • Visualizations on a dashboard in Power BI focus on building insights using attractive elements such as graphs, whereas reports do not concentrate on visualizing.

Dashboards are a great method to keep an eye on your company and quickly see all your key indicators. These visuals may be drawn from a single underlying dataset or several and even from underlying Report(s). A dashboard does more than just visualize; it updates automatically when the underlying data changes, making it highly interactive.

Advertisement

Shriram Group to open a metaverse branch by 2023

Shriram Group metaverse branch 2023

The Shriram Group has announced that the company will have a metaverse branch by the first quarter of 2023. By doing so, it will become the first Indian non-banking financial company (NBFC) to be in a metaverse.

Novac Technology Solutions, Shriram’s digital arm working on virtual reality (VR), mixed reality (MR), and augmented reality (AR), will put the group on a metaverse that will have solutions for employees and customers. 

According to Pradeep B, associate vice-president of Novac, customers will be able to experience the solutions of the brand during the first phase. We are also trying to incorporate a bot-based system, in which the customer will receive a call back based on their virtual interest, he added. 

Read More: Meta To Shut Down Super, A Live-Streaming Platform For Influencers

In the second phase, the company will get a real-time support system, which will include avatar representations of brands/products. “Depending on regulatory approvals, the company will try to see what transactions can take place online through metaverse,” he said. 

Novac said that it has over 60 customers, and out of that, about 20 use metaverse solutions. It has a tie-up with a European VR and AI soft skills trainer, Bodyswaps, to offer teaching modules to the staff of its clients. 

Advertisement

Meta to shut down Super, a live-streaming platform for influencers

Meta to shut down Super

Meta has announced that it is going to shut down its cameo-like app, Super, in February 2023. Developed by Meta in 2020, Super is a live-streaming platform for influencers. 

According to TechCrunch, Meta wanted to create a virtual meet-and-greet experience similar to what users experience at real-life events like VidCon or Comic-Con through Super. 

Although Super is not officially shutting down until February, users will not be able to create new events during this period. If users have a pre-scheduled event on Super, the company advises that the event be rescheduled on another platform. 

Read More: AWS, Meta, And Microsoft To Develop Google Maps Rival Overture Maps

Users who have participated in a Super event or have hosted one in the past can download their recorded media before the official decommissioning of its website in February. 

Super has joined a long list of apps and experiments that have been shut down by Meta this year. The company recently shut down its Facebook live shopping feature on October 1 to shift its focus to Reels.

Advertisement

Dataiku Raises $200 Million in Series F Funding

Dataiku, the well-known platform for Everyday AI, announced $200 million in Series F funding led by Wellington Management at a $3.7 billion valuation on 13th December. The investment will help to strengthen Dataiku’s leadership position and empower its capabilities.

Matt Witheiler, Consumer/Technology Sector Lead, Wellington Management, stated that Dataiku’s proven track record, growth trajectory, management team, and customer roaster help the company to scale AI to new heights. Wellington Management is pleased to partner with and contribute to Dataiku’s journey.

Read more: Hexo Raises $270,000 in Pre-Seed Funding by Antler India

Dataiku is a popular Everyday AI platform, allowing data experts and domain experts to work together to build AI products to carry out daily operations. It helps businesses to systemize the use of data for exceptional business results. About 500 businesses use Dataiku for predictive maintenance, supply chain, marketing optimization, quality control, and more.

Florian Douetteau, co-founder and CEO of Dataiku, mentioned that Dataiku is glad to attract new, market-leading investors like Wellington in today’s challenging market to strengthen its solution and a world-class team. It has taken a leadership position in helping businesses to use massive datasets at scale and create a culture of AI-focused business results.

Advertisement

Hexo Raises $270,000 in Pre-Seed Funding by Antler India

Generative AI startup Hexo has raised $270,000 as pre-seed funding led by Antler India through the venture capital firm Antler India Residency initiative.

Founded in 2012, Hexo is a fast, simple, and powerful blog framework. It is building an open-source image generation API that provides a wide range of controls to make image generation more accurate and predictable. This fine tuner product for businesses will allow them to quickly build image generation engines based on the company’s design language, characters, products, and IP and embed text to image generation in their workflow. 

Read more: Infinity AI raises $5M in a seed funding round to use synthetic data

Vignesh Baskaran, the co-founder of Hexo, mentioned that the pre-seed funding would be used to launch Hexo’s new product, a fine-tuner for businesses to build custom image generation engines. 

Antler India is thrilled to lead Hexo’s pre-seeding round and collaborate with them in the journey of generative AI. The founder of Hexo, Vignesh, is a deep learning expert and has conducted many machine learning research projects globally. Kunal Bhatia, a three-time founder, has built SuperLearn (EdTech) and Switch (IoT). Antler India believes that Vignesh’s engineering strength and Kunal’s business skills can ensure to build of great AI products for sale.

Advertisement

AWS, Meta and Microsoft to develop Google Maps rival Overture Maps

Overture Maps

The Linux Foundation has announced to create a rival for Google Maps called Overture Maps. This is a new collaborative effort to optimize interoperable open map data as a shared asset to strengthen mapping services worldwide.

It’s an open-source mapping effort that includes big companies like Amazon Web Services (AWS), Meta, Microsoft, and TomTom. The project is open to all communities with the goal of building open map data. 

The Linux Foundation announced the initiative through a press release about the project and a new website for the Overture Maps Foundation. 

Read More: Biochemists Present AlphaFill, An Upgraded Version Of AlphaFold For Protein Folding

The project will focus on integrating existing open map data from city planning departments and several projects like OpenStreetMap. It will also use new map data contributed by members and built using AI/ML techniques and computer vision to create a living digital record of the physical world.

The Overture Maps Foundation aims to power new map products via openly available datasets that may be used and reused across businesses and applications, with each member throwing their own data and resources into the mix. 

Advertisement

PubMed GPT: A GPT Model Trained on PubMed Biomedical Papers at Stanford

pubmed gpt

Researchers at the Stanford Center for Research on Foundation Models (CRFM) have recently worked on investigating industry-specific LLMs (large language models). They introduce PubMed GPT as a part of their research, specifically focusing on biomedicine. 

Using the MosaicML cloud platform, CRFM researchers trained a GPT model on PubMed biomedical papers. The resultant model is highly accurate in several NLP tasks. PubMed GPT is based on a HuggingFace foundation and uses a biomedical tokenizer trained on the Pile dataset abstracts and PubMed central sections. It uses the PyTorch framework and the composer from MosaicML for training LLMs.

After training the model, researchers evaluated it on several popular benchmarks, a critical measure being the MedQA-USMLE question-answer challenge. In addition, the researchers manually assessed its generations for a task that involved summarising questions. The researchers employed several previous CRFM and biological models, including GPT-Neo, Galactica, and PubMedBERT.

Read More: Meta Turned Down the Galactica Demo After Being Criticized as “Dangerous”

The researchers concluded that LLMs are versatile and have much to offer when trained on domain-specific datasets. But the versatility comes at a cost due to many parameters. Model complexity, cost, specialized architectures, and domain knowledge are all trade-offs with the performance of PubMedGPT.

The researchers plan to concentrate future work on enhancing the scope of the model and assessing it against a more extensive collection of NLP tasks. PubMed GPT is intended solely for research as it is yet to be developed for production.

Advertisement

Meta takes down 40 phishing accounts by CryperRoot Risk Advisory

Meta 40 phishing accounts CryperRoot Risk Advisory

Meta has taken down more than 40 accounts operated by Indian firm CyberRoot Risk Advisory for phishing. The accounts were allegedly involved in hacking-for-hire services. 

The tech giant has also taken down a network of over 900 fake accounts on Facebook and Instagram operated by an unknown entity from China. 

These accounts were designed to collect data from people in the US, China, Myanmar, India, and Taiwan. According to Meta’s Threat Report on the Surveillance-for-Hire Industry, released on December 15, the accounts focused on military personnel, pro-democracy activists, government employees, politicians, and journalists,  

Read More: Biochemists Present AlphaFill, An Upgraded Version Of AlphaFold For Protein Folding

According to the report, CyberRoot used fake accounts to create fictitious persons tailored to gain trust with the people they targeted worldwide. These accounts impersonated journalists, business executives, and media personalities to appear more credible. 

In some cases, CyberRoot also created accounts identical to those connected to their targets, like their family members or friends, with only slightly changed usernames, to trick people into engaging, the report said.

Advertisement

Donald Trump NFTs Collection Sells Out Within a Day

trump nfts collection sells out

Former US President Donald Trump’s collection of non-fungible tokens (NFTs) sells out within a day of its launch following a hyped announcement on Truth Social. The announcement featured an animated image of Trump in a superhero costume, shooting beams of laser light from his eyes. While the announcement was not exactly what people expected, everyone was surprised.

As per OpenSea, the collection’s trading volume is roughly 900 ETH (US$1.08 million), while its floor price is US$230, more than double the original price (US$99), while some select NFTs are being sold for an even higher price. The rarest kinds (roughly 1,000 NFTs) are selling for as much as 6 ETH! 

Read More: Top Excel Formulas Bots of 2022

One of these extremely uncommon trade cards depicting the 45th president carrying a torch while standing in front of the Statue of Liberty is listed for 20 ETH, or almost US$24,000. Many one-of-ones are currently held in Gnosis Safe multi-signature wallet, a secured payment wallet for receiving royalty payments via secondary NFT sales.


As per Dune Analytics, over 115 people purchased the set of 45 tickets required for an assured dinner with Trump, and over 17 people bought the maximum quantity permitted by the Trump Card site. However, more metrics hint that other wallets held far more as Trump NFTs collection sells out.

Advertisement