RowMancer
is a Command Line Interface (CLI) tool that allows you to count rows, columns, and files in CSV/TSV datasets. The tool provides various options for specific count metrics, including the ability to count blank files, specify directory depth, and calculate column statistics.
This project is under the Apache 2.0 License. See the LICENSE file for more details.
-
Clone the repository
git clone https://github.com/TheLustriVA/Rowmancer.git
-
Navigate to the project directory
cd Rowmancer
-
Install the package
pip install .
You can also install the package from PyPI:
pip install RowMancer
Run the tool with no options to count all rows in all .csv
and .tsv
files in the current directory and its subdirectories:
Rowmancer
-
Count Files:
-c, --count-files
- Count the number of
.csv
and.tsv
files instead of rows.
RowMancer --count-files
- Count the number of
-
Blank Files:
-b, --blank
- Count the number of blank or non-parsable
.csv
and.tsv
files.
RowMancer --blank
- Count the number of blank or non-parsable
-
Readable Numbers:
-l, --readable
- Show numbers in a more readable format (e.g., 1,000 instead of 1000).
RowMancer --readable
-
Directory:
dir
- Specify the directory to start the search.
RowMancer /path/to/directory
-
Header Row:
-H, --header-row
- Exclude the first row from each
.csv
file in the count.
RowMancer --header-row
- Exclude the first row from each
-
Depth:
-d, --depth
- Set the directory depth for the search.
RowMancer --depth 2
-
Column Stats:
-x, --columns
- Show column statistics (MIN, MAX, MEAN, SINGLE).
RowMancer --columns MIN
Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests.
- KGB aka Marco Lustri - With help from GPT-4
- Morgan Medici, who knows more than most have forgotten.