Articula: AI-generated Content Detection and Understanding with Low-resource Training and Augmentations Resources
This repository presents implementation for the tool Articula: AI-generated Content Detection and Understanding with Low-resource Training and Augmentations. The Articula tool analyzes both input images and texts to detect whether they are generated by AI models.
- For setting up the project, use the
source project-setup.sh
command, and it will set up a virtual environment with prerequisite packages.
- Extract the zip folders.
- Generate the log probability files required for the training data. These scripts generate token level log probabilities which are used in vector combinations to create features. The models used to get log probabilities are GPT2 and DistilBert.
python common/ghostbusters_generate.py --logprobs
python common/ghostbusters_generate.py --logprob_other
- Generate the symbolic data which will be used to perform feature selection. The symbolic data file consists of values of all possible features based on vector and scaler combination of log probabilities. This will be used for feature selection instead of calculating it at every step. Possible Tuning options: generate_symbolic_data_four (increases the depth of feature combinations)
python trainer/ghostbuster_feature_selection.py --generate_symbolic_data
- Perform Feature Selection. Based on the generated symbolic data this script selects important features from all possible features which are important to detect if a text is written by AI or not. The list of features will be saved and used further to train a classifier.
Possible Tuning options:
- perform_feature_selection_four (feature selection based on the increased depth)
- perform_feature_selection_large (feature selection based only on large text datasets)
- perform_feature_selection_lang (seperate feature selection for each language)
- perform_feature_selection_domain (seperate feature selection for different domains of training data)
python trainer/ghostbuster_feature_selection.py --perform_feature_selection
- Train the classifier (Logistic, XGBoost or Random Forest). This script uses our training data and the best selected features to train a classifier for detecting AI generated text. There are 3 different scenarios:
- Ghostbusters: Trains a classifier only using the best features found from the above steps
- Intrinsic Dimensionality MLE: Trains a classifier using MLE Value from Intrinsic Dimensionality model (models/intrinsic_dim_estimator.py).
- Combined: Trains a classifier using both, the features from ghostbusters model and the MLE value from Intrinsic Dimensionality model.
python trainer/ghostbuster-train.py --model_name "logistic" --combined
- Use the app/text_app.py to test and spoof
python app/text_app.py --input "your text"
The experimentation utilizes the bensvmark datasets employed by previous state-of-the-art (sota) models. In this project, a custom model was trained on a subset of images extracted from all these models, creating a diverse dataset.
We adopted an ensemble setup for our experimentation. The models can be trained using the following command:
cd trainer
python img-train.py --name test --dataroot [your data path] --detect_method [CNNSpot, FreDect, Fusing, Gram] --blur_prob 0.1 --blur_sig 0.0,3.0 --jpg_prob 0.1 --jpg_method cv2,pil --jpg_qual 30,100
Alternatively, already trained models are available at this drive
The ensemble setup utilizes CNNSpot, FreDect, Fusing, and Gram models. Once the pretrained models are ready and placed under the weights folder, run the following to train the ensemble meta_model:
cd trainer
python ensemble-train.py
Testing the model can be done using the following script:
cd evaluator
python ensemble_prediction_pipeline.py
Note: For running all these scripts and understanding the arguments for training/testing the model, refer to the file common/img_options.py and the config file evaluator/configs/img_eval_config.py.
Multiple strategies are experimented with, such as Gaussian attack, salt and pepper noise attack, and frequency edits. These can be experimented with using the same ensemble_prediction_pipeline.py file by changing the options. For example:
python ensemble_prediction_pipeline.py --gaussian_noise --gaus_intensity 5
python ensemble_prediction_pipeline.py --salt_pepper_noise --s_p_density 0.1
python ensemble_prediction_pipeline.py --freq --freq_prob 0.1
same can be run using the following script
cd evaluator/eval_different_attacks.sh
./eval_different_attacks.sh
=======
-
Create a specific branch named in the format like
feature/intrinsic-dimension-model
, highlighting the name of the feature model in progress and make a pull request. -
Always
rebase
the your working branch before making a PR and add--force-with-lease
based branch push. Also, we highlight the steps needed for making a PR below assuming that you are working on your feature branch and you can opt to remove conflicts via your IDE while doing therebase
for your branch.
git checkout main
git pull origin
git checkout feature/intrinsic-dimension-model
git rebase main
git push origin feature/intrinsic-dimension-model --force-with-lease
- Finally, after reviewing the pull request, please
squash
all the commits during the merge into the main branch. Also, add in meaningful PR heading and joint-commit message for all your intermediate commits.
Note: Please, do not push directly onto the main branch, follow the proposed conventions, and also add another person as reviewer of your code.