Azure powers intelligent services that have recently caught our attention, such as Microsoft Copilot, Bing, and Azure OpenAI Service

Large language models (LLMs) are the secret sauce behind these services, which enable a plethora of applications such as Microsoft Office 365, chatbots, and search engines with generative AI

[{"selector":"#anim-fa4e9f47-5ca1-4734-814e-0710c405726d [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(35.937499914669296%, 0, 0)","translate3d(0%, 0, 0)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}] [{"selector":"#anim-864c30f8-5a3c-48e4-8d19-05cfef94ac15","keyframes":{"transform":["scale(1)","scale(1.5)","scale(0.95)","scale(1)"],"offset":[0,0.33,0.66,1]},"delay":0,"duration":1450,"easing":"ease-in-out","fill":"both","iterations":1}]

How Microsoft uses LLMs to their fullest potential But developing new LLMs or enhancing the precision of already-existing ones is a difficult task

[{"selector":"#anim-3e816bec-f11d-4e19-88d6-6ef138a91cd8 [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(-34.249999904429615%, 0, 0)","translate3d(0%, 0, 0)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}] [{"selector":"#anim-b9da55bf-19a4-4379-8561-d4d8f03ebb31","keyframes":{"transform":["rotate(-540deg) scale(0.1)","none"],"opacity":[0,1]},"delay":0,"duration":1000,"fill":"both","iterations":1}]

The most recent MLPerf 3.1 Training results demonstrate their constant dedication to developing high-caliber, high-performance cloud platforms in order to achieve unmatched efficiency when training large numbers of LLMs

[{"selector":"#anim-25ce7d60-10da-4f06-8886-d4035ccc68ba [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(32.692307587285285%, 0, 0)","translate3d(0%, 0, 0)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}] [{"selector":"#anim-f2bc4f2a-3a37-41db-80c2-cfd8406be4a8","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-0d06e355-a8bd-4add-b7b1-15d777fb2aad","keyframes":{"transform":["translate3d(115.73770%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}]

On 1,344 ND H100 v5 virtual machines (VMs), which represent 10,752 NVIDIA H100 Tensor Core GPUs connected by the NVIDIA Quantum-2 InfiniBand networking infrastructure, the GPT-3 LLM model and its 175 billion parameters

[{"selector":"#anim-3ae0e3db-9e57-4c68-8d7e-46b9d6b2c968 [data-leaf-element=\"true\"]","keyframes":{"transform":["translate(0%, 0%) scale(1.5)","translate(0%, 0%) scale(1)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"forwards"}] [{"selector":"#anim-25afd53b-d04f-4f6f-95d8-4d054f72d25e","keyframes":{"opacity":[0,1]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}] [{"selector":"#anim-d418a103-b244-487c-b195-172e45c5a0b2","keyframes":{"transform":["translate3d(0px, -129.03156%, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":600,"easing":"cubic-bezier(0.2, 0.6, 0.0, 1)","fill":"both"}]

The workload puts a strain on the Tensor Cores of the H100 GPUs, the direct-attached Non-Volatile Memory Express disks, and the NVLink interface

[{"selector":"#anim-81fcdd3b-0677-4df6-9236-c7cafd3c61dc [data-leaf-element=\"true\"]","keyframes":{"transform":["translate(0%, 0%) scale(1.5)","translate(0%, 0%) scale(1)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"forwards"}] [{"selector":"#anim-7016d44a-f0ef-4983-809c-a370e030fb80","keyframes":{"transform":["scale(1)","scale(1.5)","scale(0.95)","scale(1)"],"offset":[0,0.33,0.66,1]},"delay":0,"duration":1450,"easing":"ease-in-out","fill":"both","iterations":1}]

Azure has made remarkable progress in optimizing the size of training, as evidenced by its largest contribution in the history of MLPerf Training

[{"selector":"#anim-b0217ef6-daea-4a32-b2b9-fd5c4d347947 [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(35.937499914669296%, 0, 0)","translate3d(0%, 0, 0)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}] [{"selector":"#anim-0de76d97-aea7-46f1-9c54-a9b1cf35a112","keyframes":{"opacity":[0,1]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}] [{"selector":"#anim-d3d7d99e-be7e-493a-bae4-9c1f20a0dcc4","keyframes":{"transform":["scale(0.3333333333333333)","scale(1)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"forwards"}]

Microsoft unveiled the ND H100 v5-series in March 2023, surpassing their previous record in 5.4 minutes to train a 350 million parameter Bidirectional Encoder Representations from Transformers

[{"selector":"#anim-b935ae8a-e4a3-4f9c-91c6-472d373d68f3 [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(35.937499914669296%, 0, 0)","translate3d(0%, 0, 0)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}] [{"selector":"#anim-969c97f3-3075-469e-a0c5-4e338ee7ba85","keyframes":{"transform":["translate3d(133.06453%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":1000,"easing":"cubic-bezier(.2, 0, .8, 1)","fill":"both"}] [{"selector":"#anim-245420bf-9150-483e-9b2e-4a6202a88895","keyframes":{"transform":["rotateZ(180deg)","rotateZ(0deg)"]},"delay":0,"duration":1000,"easing":"cubic-bezier(.2, 0, .5, 1)","fill":"forwards"}]

Compared to the NVIDIA bare-metal submission, which offers the best-in-class virtual machine performance across all offerings of HPC instances in the cloud, this translates to just a 2 percent increase in training time in Azure For more details visit govindhtech.com

[{"selector":"#anim-007f5499-65c0-40d3-9af3-ac82d2f2f424 [data-leaf-element=\"true\"]","keyframes":{"transform":["translate3d(32.692307587285285%, 0, 0)","translate3d(0%, 0, 0)"]},"delay":0,"duration":2000,"easing":"cubic-bezier(.3,0,.55,1)","fill":"both"}] [{"selector":"#anim-33288e2c-fae1-4c1e-ad2b-58fda1b4c54b","keyframes":{"transform":["translate3d(-144.49542%, 0px, 0)","translate3d(0px, 0px, 0)"]},"delay":0,"duration":1000,"easing":"cubic-bezier(.2, 0, .8, 1)","fill":"both"}] [{"selector":"#anim-5f99eb1b-6229-4689-be9c-8950eead8c07","keyframes":{"transform":["rotateZ(-180deg)","rotateZ(0deg)"]},"delay":0,"duration":1000,"easing":"cubic-bezier(.2, 0, .5, 1)","fill":"forwards"}]