Large Language Models

www.analyticsdrift.com Image Credit: Analytics Drift Produced By: Sahil Pawar

Disclaimer

[{"selector":"#anim-b6f153b8-3af1-494f-9cd2-beac641d28a9","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-844b4807-d8fd-438e-a9a5-c14c2107cafd","keyframes":{"transform":["translate3d(-118.15181%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] By open-source, we mean that the original code is available to the public to be modified through either GitHub or other means.

LaMDA - Google

[{"selector":"#anim-2e031eb3-6403-4d44-9aa9-bd55266e96b0","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-a5eb62c9-ba45-4964-970b-ebff46ba7bf6","keyframes":{"transform":["translate3d(-115.82492%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] Google created the conversational LLM known as LaMDA (Language Model for Dialogue Application) as the core technology for apps that use dialogue and can produce human-sounding language.

BERT - Google

[{"selector":"#anim-28d253a9-a479-47d2-ac73-e86cf05a00f9","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-6c8f6e9f-b6b6-43a4-bdc6-f8e83fe435d6","keyframes":{"transform":["translate3d(-114.55696%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] Researchers at Google AI developed the well-known language model BERT (Bidirectional Encoder Representations from Transformers) in 2018 which considerably impacted the NLP field.

LLaMA 2 - Meta AI

[{"selector":"#anim-b733bc5a-ecfb-4ef8-97cf-5b1dc6fc3683","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-f0e3df91-6fcf-4a0f-8431-d73a590a746c","keyframes":{"transform":["translate3d(-115.97222%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] LLaMA (Large Language Model Meta AI) is a large language model that Meta AI announced in February 2023. It's latest version LLaMA 2 was released on July 19.

Orca - Microsoft

[{"selector":"#anim-08491066-784b-4f13-a193-62337f3aeeb6","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-2a688cf7-5b1f-4637-8ad2-715413cb1ff9","keyframes":{"transform":["translate3d(-115.87838%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] Orca was developed by Microsoft and has 13 billion parameters. It aims to improve on advancements made by other open source models by imitating the reasoning procedures achieved by LLMs.

Bloom - BigScience

[{"selector":"#anim-57412912-aaa1-439b-9ddd-36e0efaf668b","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-860364a3-6b25-4f1b-8901-952a2bb4de44","keyframes":{"transform":["translate3d(-116.21622%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] The Big Open-science Open-access Multilingual Language Model (BLOOM) is a significant language model developed by BigScience based on transformers.

PaLM 2 - Google

[{"selector":"#anim-6b2ce178-0fe6-440a-bebd-7def7a0bafcb","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-fb5a6756-6532-42b7-9e28-758aa957f4c2","keyframes":{"transform":["translate3d(-116.00001%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] PaLM 2 was pre-trained on parallel multilingual text and on a much larger corpus of different languages than its predecessor, PaLM. This makes PaLM 2 excel at multilingual tasks.

StableLM - Stability AI

[{"selector":"#anim-1e54ae12-3404-473a-9419-8260b817c986","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-1a4edb05-556d-46a8-afa0-73d96cab62a1","keyframes":{"transform":["translate3d(-117.58242%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] StableLM is a series of open source language models developed by Stability AI, the company behind image generator Stable Diffusion. It is based on a dataset called 'The Pile'.

Dolly 2.0 - Databricks

[{"selector":"#anim-f86d1e91-beeb-466a-847c-df2bebab6e09","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-eb0b3fa4-6b04-40ed-aa5a-4a3d09d1b8a8","keyframes":{"transform":["translate3d(-116.00001%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] The Databricks machine-learning platform was used to train Dolly, an LLM that learns to obey commands. It was trained using roughly 15k instruction/response fine-tuning records based on Pythia-12b.

Cerebras-GPT - Cerebras

[{"selector":"#anim-4dc12204-22e4-4748-a503-d7851bd4a75c","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-74b018cb-532d-4af9-93fd-84a05ee05f91","keyframes":{"transform":["translate3d(-115.84159%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] The Cerebras-GPT family is released to facilitate research into LLM scaling laws using open architectures and data sets and demonstrate the scalability of training LLMs on the Cerebras software and hardware stack.

Galactica - Meta

[{"selector":"#anim-42f1c65f-3058-4b97-a4cc-35e8bbe380c9","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-cca32ef8-40bc-4741-adce-e09f38bb2b8b","keyframes":{"transform":["translate3d(-116.05352%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] Just three days into its public release in November 2022, Galactica was Meta's LLM designed specifically for scientists. It was trained on a collection of academic material.

XLNet - Google

[{"selector":"#anim-6c35dfb4-9784-4843-95ad-3189b165e86f","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-1772d413-8ffb-4f5b-886d-508a41c1612a","keyframes":{"transform":["translate3d(-116.05352%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] A language model called XLNet was released in 2019 by Google AI researchers. It overcomes the drawbacks of conventional LLMs, such as left-to-right or auto-regressive pre-training methods. Learn more Opening https://analyticsdrift.com/

Join our WhatsApp Channel Now!

[{"selector":"#anim-e95be448-cc06-4f8e-8a58-999798930a7f","keyframes":{"opacity":[0,1]},"delay":100,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-5265a428-251b-4e2f-b1e4-9022e31ccb83","keyframes":{"transform":["translate3d(0px, -145.13887%, 0)","translate3d(0px, 0px, 0)"]},"delay":100,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-5a43b5b5-ab06-419c-932c-7b565b0503b1","keyframes":{"opacity":[0,1]},"delay":200,"duration":1500,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] Get the latest updates on AI developments. Designed by: Prathamesh Join Now Opening https://www.whatsapp.com/channel/0029Va4lGiPIXnlw2R2W4T0T