Integrated-Gradients

The PyTorch code for integrated gradients in paper.

The original implementation is in tensorflow and the captum's code is kind of heavy. Thus this repository comes.

The code here can be very simple for understanding the theorem and extending to other understream tasks.

How to Use

Here is an example for sentiment classification in NLP (in sentiment_classification.ipynb). Note this code must be on the Jupyter Notebook for its visualization depends on the Ipython. The main code is in integrated_gradients.py.

# for sentiment classification
pretrain_path = "fabriceyhc/bert-base-uncased-imdb"
bert_config = BertConfig.from_pretrained(pretrain_path)
bert_config.output_hidden_states = True
model = BertForSequenceClassification.from_pretrained(pretrain_path, config=bert_config).cuda()

tk = BertTokenizer.from_pretrained(pretrain_path)
input_text = ["I am happy because the weather is extremely good!",
              "This film is bad! I hate it!"]
inputs = tk(input_text, max_length=128, return_tensors="pt", truncation=True, padding=True)
inputs = {k: v.cuda() for k, v in inputs.items()}
grads, labels = get_integrated_gradients(model, **inputs)
scores = get_scores(grads)
tok_text = [tk.tokenize(t) for t in input_text]
# get the top-3 strong and weak correlation
positives, negatives, tok_text = get_related(tok_text, scores, 3)
# red for strong correlation
# green for weak correlation
# deep and light colors are for stronger and weaker correlation respectively 
visualize(tok_text, positives, negatives, labels)

The result should be:

How to cite? 🔗🔗🔗

@misc{IG_Tai,
  author = {Yunpengtai},
  title = {The implementation for Integrated Gradients},
  year = {2022},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/sherlcok314159/Integrated-Gradients}}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Integrated-Gradients

How to Use

How to cite? 🔗🔗🔗

Files

README.md

Latest commit

History

README.md

File metadata and controls

Integrated-Gradients

How to Use

How to cite? 🔗🔗🔗