Pre-Training the BERT 1.5B model with DeepSpeed
In this post, we show you how to run Habana’s DeepSpeed enabled BERT1.5B model from our Model-References repository.
Read moreIn this post, we show you how to run Habana’s DeepSpeed enabled BERT1.5B model from our Model-References repository.
Read moreIn this post, we will learn how to run PyTorch stable diffusion inference on Habana Gaudi processor, expressly designed for the purpose of efficiently accelerating AI Deep Learning models.
Read moreIn training workloads, there may occur some scenarios in which graph re-compilations occur. This can create system latency and slow down the overall training process with multiple iterations of graph compilation. This blog focuses on detecting these graph re-compilations.
Read moreIn this tutorial we will learn how to write code that automatically detects what type of AI accelerator is installed on the machine (Gaudi, GPU or CPU), and make the needed changes to run the code smoothly.
Read moreIn this tutorial, we will demonstrate fine tuning a GPT2 model on Habana Gaudi AI processors using Hugging Face optimum-habana library with DeepSpeed.
Read moreOne of the key challenges in Large Language Model (LLM) training is reducing the memory requirements needed for training without sacrificing compute/communication efficiency and model accuracy.
Read moreIn this post, we will learn how to migrate a TensorFlow EfficientNet model from running initially on a CPU to a Habana Gaudi Training processor, expressly designed for the purpose of efficiently […]
Read moreThe Habana team is happy to announce the release of SynapseAI® version 1.6.0. In this release, we introduce preliminary inference capabilities on Gaudi. For further details, refer to the Inference on Gaudi guide. […]
Read moreSynapseAI 1.5 brings many improvements, both in usability and in Habana ecosystem support. For PyTorch we removed the need for weight permutation, as well as the need to explicitly call load_habana_module. See […]
Read moreThe Habana team is excited to have launched our next-gen 7nm training and inference processors, Gaudi2 for training and Greco for inference, this morning on stage at Intel Vision 22. Sandra Rivera, […]
Read moreThe Habana(R) Labs team is pleased to augment software support for the Gaudi platform with the release of SynapseAI® version 1.4.0. In this release, we have made several version updates. We now support […]
Read more