-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
960f2fb
commit 90fd848
Showing
15 changed files
with
121 additions
and
30 deletions.
There are no files selected for viewing
20 changes: 20 additions & 0 deletions
20
docs/writing/posts/Karpathy's - let's build GPT from scratch.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
--- | ||
draft: true | ||
date: 2024-03-19 | ||
slug: lets-build-gpt-from-scratch | ||
tags: | ||
- llm | ||
authors: | ||
- Prabha | ||
--- | ||
!!!note "Self Note" | ||
This note is for me to understand the concepts | ||
|
||
|
||
!!!note "Learning Resource" | ||
Karpathy's tutorial on Youtube [Lets build GPT from scratch](https://www.youtube.com/watch?v=kCc8FmEb1nY&t=2794s) | ||
|
||
|
||
ChatGPT is probabilistic system | ||
|
||
Transformer Neural Net is used for LLMs |
Binary file not shown.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,12 @@ | ||
--- | ||
draft: true | ||
date: 2024-03-12 | ||
slug: | ||
tags: | ||
- llm | ||
authors: | ||
- Prabha | ||
--- | ||
|
||
- .spm is associated with SentencePiece Model files. | ||
- SentencePiece is a library and tool for unsupervised text tokenization and detokenization |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,17 @@ | ||
--- | ||
draft: true | ||
date: 2024-03-06 | ||
slug: looker-download-tracking | ||
tags: | ||
- "#looker" | ||
authors: | ||
- Prabha | ||
--- | ||
# Question: Is there a way to track the user downloads in Looker? | ||
|
||
Response from Looker Support Below: | ||
So, there is no direct way to track download activities. This is currently a Feature Request - [https://portal.feedback.us.pendo.io/app/#/case/190687](https://portal.feedback.us.pendo.io/app/#/case/190687) | ||
|
||
However, we have a workaround using System Activity Event Attribute explore, please check the link - [https://madhive.cloud.looker.com/explore/system__activity/event_attribute?fields=event.created_time,user.name,event_attribute.event_id,event_attribute.name,event_attribute.value&f[event.category]=query&f[event.created_date]=30+days&f[event.name]=export%5E_query&sorts=event.created_time&limit=5000&query_timezone=America%2FNew_York&vis=%7B%7D&filter_config=%7B%22event.category%22%3A%5B%7B%22type%22%3A%22%3D%22%2C%22values%22%3A%5B%7B%22constant%22%3A%22query%22%7D%2C%7B%7D%5D%2C%22id%22%3A0%2C%22error%22%3Afalse%7D%5D%2C%22event.created_date%22%3A%5B%7B%22type%22%3A%22past%22%2C%22values%22%3A%5B%7B%22constant%22%3A%2230%22%2C%22unit%22%3A%22day%22%7D%2C%7B%7D%5D%2C%22id%22%3A2%2C%22error%22%3Afalse%7D%5D%2C%22event.name%22%3A%5B%7B%22type%22%3A%22%3D%22%2C%22values%22%3A%5B%7B%22constant%22%3A%22export_query%22%7D%2C%7B%7D%5D%2C%22id%22%3A4%2C%22error%22%3Afalse%7D%5D%7D&dynamic_fields=%5B%5D&origin=share-expanded](https://madhive.cloud.looker.com/explore/system__activity/event_attribute?fields=event.created_time,user.name,event_attribute.event_id,event_attribute.name,event_attribute.value&f[event.category]=query&f[event.created_date]=30+days&f[event.name]=export%5E_query&sorts=event.created_time&limit=5000&query_timezone=America%2FNew_York&vis=%7B%7D&filter_config=%7B%22event.category%22%3A%5B%7B%22type%22%3A%22%3D%22%2C%22values%22%3A%5B%7B%22constant%22%3A%22query%22%7D%2C%7B%7D%5D%2C%22id%22%3A0%2C%22error%22%3Afalse%7D%5D%2C%22event.created_date%22%3A%5B%7B%22type%22%3A%22past%22%2C%22values%22%3A%5B%7B%22constant%22%3A%2230%22%2C%22unit%22%3A%22day%22%7D%2C%7B%7D%5D%2C%22id%22%3A2%2C%22error%22%3Afalse%7D%5D%2C%22event.name%22%3A%5B%7B%22type%22%3A%22%3D%22%2C%22values%22%3A%5B%7B%22constant%22%3A%22export_query%22%7D%2C%7B%7D%5D%2C%22id%22%3A4%2C%22error%22%3Afalse%7D%5D%7D&dynamic_fields=%5B%5D&origin=share-expanded) | ||
|
||
You can also refer to- [https://www.googlecloudcommunity.com/gc/Technical-Tips-Tricks/Track-Downloads-in-System-Activity-workaround/ta-p/586869](https://www.googlecloudcommunity.com/gc/Technical-Tips-Tricks/Track-Downloads-in-System-Activity-workaround/ta-p/586869) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,9 @@ | ||
|
||
|
||
|
||
|
||
| GPU | Cost | Speed (General Performance) | Memory | Energy Consumption | Training Suitability | Inference Suitability | Other Factors | | ||
| --------------- | ---- | -------------------------------------- | ------------- | ------------------ | -------------------- | --------------------- | -------------------------------------------------------------------------------- | | ||
| **NVIDIA T4** | $$ | Good for inference | 16 GB GDDR6 | Low (70W) | Limited | High | Efficient for edge computing and power-sensitive environments | | ||
| **NVIDIA V100** | $$$ | Excellent for training & inference | 16-32 GB HBM2 | Moderate (250W+) | High | High | Well-suited for AI model training, HPC | | ||
| **NVIDIA A100** | $$$$ | Superior for both training & inference | 40 GB HBM2e | High (400W) | Very High | Very High | Supports Multi-Instance GPU (MIG) for versatile workloads, Sparsity acceleration | |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,25 @@ | ||
--- | ||
draft: true | ||
date: 2024-03-06 | ||
slug: my-q_and_a | ||
tags: | ||
- llm | ||
authors: | ||
- Prabha | ||
--- | ||
!!!note "Self Note" | ||
This note is for me to understand the concepts | ||
# MY Q&A | ||
|
||
## Why is the fine-tuned LLM Model is faster? | ||
|
||
|
||
|
||
|
||
### Why DSPy compiled program score less than the uncompiled program? | ||
|
||
[Colab Notebook](https://colab.research.google.com/drive/1wwyCGgKizNZo48IzfKa9m4uMp2SNRBSF#scrollTo=IyjklZsKCxF-) | ||
|
||
- Found that chain of thought doesn't give the expected output (gives out blank output) so metric calculation is not meaningful. | ||
|
||
- So make sure DSPy lm model is producing expected output. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,13 @@ | ||
|
||
- LLM model is getting bigger, quantization helps reduce model size with little to no quality degradation | ||
- ![[Pasted image 20240415112817.png]] | ||
|
||
- Some state of art methods to reduce models are | ||
- Pruning | ||
- Removing layers which do not contribute to model decisions | ||
- Knowledge Distillation | ||
- Train a student model using the teacher model, | ||
- Challenge is you still need to fit original large model in your machine | ||
- Quantization | ||
- in nn you can quantize weights, activations | ||
- Idea is to represent model weights with lower precision (achieved by converting to different dtype ) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters