Skip to content

Latest commit

 

History

History
28 lines (17 loc) · 550 Bytes

README.md

File metadata and controls

28 lines (17 loc) · 550 Bytes

[ English | 中文 ]

image

AthenaGen

Generate data using OLLAMA and use a simple script to create a pre-training data set

how to use

Refer to how to use ollama serve to start the api.

Run python Synth.py

Enter the ollama model you want to use.

To enter prompts, enter done to end manual prompt entry, or place one prompt per line in the txt file.

alltxttojson.py

Modify data=""

For example:

txt is stored in ./cleaned3

data="./cleaned3"

Run python alltxttojson.py

You will get the data{}.json file