Under the Hood of Llama 3.1 70B Distributed InferenceThe following are some notes on how the Llama 3.1 70B model works in distributed environment. Will focus only on the pure PyTorch…Sep 3Sep 3
Pre-training Mini Versions of LLMs — GPT and Llama3This blog goes over how to pre-train small versions of the leading open source Large Language Models (LLM). Here 3 models are covered — 2…Jun 17Jun 17
Some Core Principles of Large Language Model (LLM) TuningLarge Language Models ( LLM) such as Chat-GPT , Llama2, etc have taken the world by storm this year. What seems like a recent phenomenon…Dec 31, 2023Dec 31, 2023
Neural Network and AI BottlenecksThis is a relatively high-level and simplified view of where the limitations are in modern Deep Learning (DL) based AI stack. Compute can…Jul 10, 20231Jul 10, 20231
Diffusion Models for Generative AIRecently diffusion based generative networks such as Stable Diffusion, DALL-E2, Imagen, etc have garnered lot of publicity. In this…Feb 27, 20232Feb 27, 20232
TensorFlow — Graph, GraphDef, Grappler, XLA, MLIR, LLVM, etcTensorFlow is a large and evolving code base with mix of mostly C++ and Python code. It has grown immensely since its first public…Jul 23, 20222Jul 23, 20222
Deep Learning Multiview Stereo (MVS)The goal of Multiview Stereo (MVS) is to generate a 3D point cloud or model from pictures taken from different locations. It is a problem…Jan 18, 2022Jan 18, 2022
ABCD Analysis of Human EyeIn the following passages, try to capture my journey to understand some of the basic optical properties of eyes. Started with some well…Nov 7, 2021Nov 7, 2021
CVPR 2021 Select Paper ReviewsReviewed a few papers that were presented orally at CVPR 2021. Tried to capture their essence in the following.Jul 28, 2021Jul 28, 2021
Review of Papers on Unsupervised Learning Presented in NIPS 2020Looked at 3 papers (among many more) on unsupervised learning that were presented orally at NIPS 2020. Tried to capture their essence in…Feb 17, 2021Feb 17, 2021