Subrata Goswami – Medium

Subrata Goswami

Revisiting AI Scaling Law

So far advances in AI have been made by primarily by increasing the size of the model. From GPT-2 to GPT-4, the number of parameter…

Feb 23

Revisiting AI Scaling Law

Feb 23

Under the Hood of Llama 3.1 70B Distributed Inference

The following are some notes on how the Llama 3.1 70B model works in distributed environment. Will focus only on the pure PyTorch…

Sep 3, 2024

Under the Hood of Llama 3.1 70B Distributed Inference

Sep 3, 2024

Pre-training Mini Versions of LLMs — GPT and Llama3

This blog goes over how to pre-train small versions of the leading open source Large Language Models (LLM). Here 3 models are covered — 2…

Jun 17, 2024

Pre-training Mini Versions of LLMs — GPT and Llama3

Jun 17, 2024

Some Core Principles of Large Language Model (LLM) Tuning

Large Language Models ( LLM) such as Chat-GPT , Llama2, etc have taken the world by storm this year. What seems like a recent phenomenon…

Dec 31, 2023

Some Core Principles of Large Language Model (LLM) Tuning

Dec 31, 2023

Neural Network and AI Bottlenecks

This is a relatively high-level and simplified view of where the limitations are in modern Deep Learning (DL) based AI stack. Compute can…

Jul 10, 2023

Neural Network and AI Bottlenecks

Jul 10, 2023

Diffusion Models for Generative AI

Recently diffusion based generative networks such as Stable Diffusion, DALL-E2, Imagen, etc have garnered lot of publicity. In this…

Feb 27, 2023

Diffusion Models for Generative AI

Feb 27, 2023

TensorFlow — Graph, GraphDef, Grappler, XLA, MLIR, LLVM, etc

TensorFlow is a large and evolving code base with mix of mostly C++ and Python code. It has grown immensely since its first public…

Jul 23, 2022

TensorFlow — Graph, GraphDef, Grappler, XLA, MLIR, LLVM, etc

Jul 23, 2022

Deep Learning Multiview Stereo (MVS)

The goal of Multiview Stereo (MVS) is to generate a 3D point cloud or model from pictures taken from different locations. It is a problem…

Jan 18, 2022

Deep Learning Multiview Stereo (MVS)

Jan 18, 2022

ABCD Analysis of Human Eye

In the following passages, try to capture my journey to understand some of the basic optical properties of eyes. Started with some well…

Nov 7, 2021

ABCD Analysis of Human Eye

Nov 7, 2021

CVPR 2021 Select Paper Reviews

Reviewed a few papers that were presented orally at CVPR 2021. Tried to capture their essence in the following.

Jul 28, 2021

CVPR 2021 Select Paper Reviews

Jul 28, 2021

Subrata Goswami

Subrata Goswami

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech