The library implement codec for .bvecs, .fvecs and .ivecs vector file formats developed by INRIA LEAR and TEXMEX groups.
Vector file formats (.bvecs, .fvecs and .ivecs) is a serialized sequence of vectors []byte
, []float32
or []int32
.
Each vector is
32bit ⟨𝒅⟩ * n-bits
|--------------|-------/ ... /-------|
⟨𝒅⟩ dimension
- .fvecs stores 32-bit float (n = 32bit)
- .ivecs stores 32-bit signed or unsigned integers (n = 32bits)
- .bvecs stores bytes (n = 8bit)
The latest version of the library is available at main
branch of this repository. All development, including new features and bug fixes, take place on the main
branch using forking and pull requests as described in contribution guidelines. The stable version is available via Golang modules.
import "github.com/kshard/fvecs"
//
// Reading vectors
r, err := os.Open("siftsmall_base.fvecs")
d := fvecs.NewDecoder[float32](r)
for {
v, err := d.Read()
if err != nil {
break
}
}
//
// Writing vectors
w, err := os.Create("siftsmall_base.fvecs")
e := fvecs.NewEncoder[float32](w)
for _, v := range /* source of []float32 vectors */ {
err := e.Write(v)
if err != nil {
break
}
}
The library is MIT licensed and accepts contributions via GitHub pull requests:
- Fork it
- Create your feature branch (
git checkout -b my-new-feature
) - Commit your changes (
git commit -am 'Added some feature'
) - Push to the branch (
git push origin my-new-feature
) - Create new Pull Request
The build and testing process requires Go version 1.13 or later.
build and test library.
git clone https://github.com/kshard/fvecs
cd fvecs
go test
The commit message helps us to write a good release note, speed-up review process. The message should address two question what changed and why. The project follows the template defined by chapter Contributing to a Project of Git book.
If you experience any issues with the library, please let us know via GitHub issues. We appreciate detailed and accurate reports that help us to identity and replicate the issue.