Meet OpenHathi, the first LLM Chatbot Superior in Hindi

www.analyticsdrift.com Image source: SaravamAI

[{"selector":"#anim-2940e6b8-bdf1-4433-ad7c-014d22206cd7","keyframes":{"opacity":[0,1]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-a17d7f30-cb05-4def-92ca-33403cdf4800","keyframes":{"transform":["translate3d(0px, 190.37697%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-83d0d825-17f6-4751-a5ec-31dca40da8a5","keyframes":{"opacity":[0,1]},"delay":600,"duration":1500,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] Indian AI startup, Sarvam AI, has introduced OpenHathi-Hi-vo.1, representing the inaugural release within the OpenHathi series of large language models. Image source: Sarvam.ai

[{"selector":"#anim-620a0743-b92a-48b3-8cf1-0d12f07690fe","keyframes":{"opacity":[0,1]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-b5934a1c-a5b0-4c00-bc22-e5d4c9b48edf","keyframes":{"transform":["translate3d(0px, 167.32999%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-230e1990-bc3b-4b16-ae78-c2e5309c301b","keyframes":{"opacity":[0,1]},"delay":600,"duration":1500,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] The model expands upon the powerful Llama2-7B and boats performance similar to GPT-3.5 (sometimes even surpassing), specifically tailored for Indic languages. Image source: Llama

[{"selector":"#anim-8feaf46e-33a4-4644-b6e8-dc2146b72da0","keyframes":{"opacity":[0,1]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-cbc55b64-6e5d-46de-b1c7-95ca317bcf81","keyframes":{"transform":["translate3d(0px, 192.16266%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-c5ed5f7f-a1e0-4111-b337-3014bada3350","keyframes":{"opacity":[0,1]},"delay":600,"duration":1500,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] OpenHathi notably expanded the Llama2-7B tokenizer by adding 48,000 more tokens. This is possible as a result of a meticulous two-phase training process. Image source: Sarvam.ai

[{"selector":"#anim-4ad88a6f-5f8e-4a45-a345-c87ced2cdfa9","keyframes":{"opacity":[0,1]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-fbe9c9a9-2d82-4cc0-a446-9d5bd7b2b577","keyframes":{"transform":["translate3d(0px, 205.86774%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-86ce921b-45f2-412b-a215-5dac0c73b778","keyframes":{"opacity":[0,1]},"delay":600,"duration":1500,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] Initially, the focus lies on embedding alignment, a method that strategically aligns the initial random Hindi embeddings. Image source: Canva

[{"selector":"#anim-eab8d582-fd68-47a1-9016-022d358ee580","keyframes":{"opacity":[0,1]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-c6fafca5-f58f-4470-b03a-25ccb9cf7c2e","keyframes":{"transform":["translate3d(0px, 180.55552%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-cd29e9c6-5c71-426c-8068-ccaafcbc07c6","keyframes":{"opacity":[0,1]},"delay":600,"duration":1500,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] Following this is the bilingual language modeling phase, which educates the model on how to handle different languages attentively across tokens. Image source: Canva

[{"selector":"#anim-0d2417a3-0ae5-4bc9-bcec-324120383d47","keyframes":{"opacity":[0,1]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-9376a769-69de-4453-a3b4-33c60f86d0be","keyframes":{"transform":["translate3d(0px, 184.12695%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-7be1374b-b18c-4b15-b9b1-6c6627788d9b","keyframes":{"opacity":[0,1]},"delay":600,"duration":1500,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] Sarvam AI’s rigorous assessments cover not just standard Natural Language Generation tasks but also practical, real-world challenges. Image source: Canva

[{"selector":"#anim-b2b2e9b0-c206-47c8-a00e-027c7e843375","keyframes":{"opacity":[0,1]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-2a0c28e6-da92-4ff3-9a2a-1f7d38ded670","keyframes":{"transform":["translate3d(0px, 180.55552%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-1ba43931-2aaf-4a39-9aec-3cfd81ac1676","keyframes":{"opacity":[0,1]},"delay":600,"duration":1500,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] These evaluations, comparing OpenHathi against GPT-3.5 with GPT-4 as the referee, consistently highlight OpenHathi’s superior performance in Hindi. Image source: Sarvam.ai

[{"selector":"#anim-c8caa2c0-b513-412b-8f4f-1bc5792453e1","keyframes":{"opacity":[0,1]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-7661f22f-96d4-414c-b358-cf2943149169","keyframes":{"transform":["translate3d(0px, 180.55552%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-22b67f32-d311-4950-8d06-556e5f84c11d","keyframes":{"opacity":[0,1]},"delay":600,"duration":1500,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] This collaboration saw Sarvam AI teaming up with academic partners from AI4Bharat, bringing in crucial language resources and benchmarking knowledge. Image source: Canva

[{"selector":"#anim-9301ad64-a492-4f31-bb69-635a1a5d92c5","keyframes":{"opacity":[0,1]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-70c13b43-958d-42e7-8ddf-bc1e170bb446","keyframes":{"transform":["translate3d(0px, 182.34126%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-4e09a9d0-2f00-4d66-8177-664bf9788e0f","keyframes":{"opacity":[0,1]},"delay":600,"duration":1500,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] Moreover, the model’s refinement was a result of collaboration with KissanAI, utilizing conversational data derived from a bot engaging with farmers in diverse languages. Image source: Canva

[{"selector":"#anim-98730395-01c9-4771-87a1-f26b32d31b20","keyframes":{"opacity":[0,1]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-1c037838-0521-4144-890a-922e52d1e6c4","keyframes":{"transform":["translate3d(0px, 182.34126%, 0)","translate3d(0px, 0px, 0)"]},"delay":500,"duration":1500,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-4f2e147d-85cc-467f-a43d-f13fc50c97cb","keyframes":{"opacity":[0,1]},"delay":600,"duration":1500,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] Pratyush Kumar and Vivek Raghavan, the founders of Sarvam AI, initiated this venture in July 2023. They received $41 million in Series A funding. Image source: Linkedin Read more

Get the latest updates on AI developments

[{"selector":"#anim-d04f9b25-2a1f-4469-91fb-9ca2162405f3","keyframes":{"opacity":[0,1]},"delay":200,"duration":1500,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-38a9a2c7-7a74-4637-97f5-2fb9b093d3f7","keyframes":{"transform":["translate3d(-103.35917%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-e011d1c6-14a1-4c1f-97b3-adc4d955dc47","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-e46d2d02-f0bf-4140-bced-fb8475f4f4f4","keyframes":{"transform":["scale(0.15)","scale(1)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"forwards"}] [{"selector":"#anim-7d90e7e7-9ded-4018-8bce-db13706dc528","keyframes":{"transform":["translate3d(134.00810%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-7cca5c09-7d8b-4124-9ffd-5caaa0f117db","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-7772fe4a-bc53-435d-96af-3f01ac6f502a","keyframes":{"transform":["scale(0.15)","scale(1)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"forwards"}] [{"selector":"#anim-8175f603-a00e-4a2c-b5d5-240e93790403","keyframes":{"transform":["translate3d(129.34363%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-368de8e9-07a7-443c-959f-cbac60feb229","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"both"}] [{"selector":"#anim-fa254fc2-ab9c-4c03-864c-eaa26a6825b4","keyframes":{"transform":["scale(0.15)","scale(1)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.4, 0.4, 0.0, 1)","fill":"forwards"}] Produced by: Boudhayan Ghosh Designed by: Prathamesh Join Now

Meet OpenHathi, the first LLM Chatbot Superior in Hindi

Get the latest updates on AI developments

WhatsApp

Join our

Channel Now!