readme
This commit is contained in:
16
README.md
16
README.md
@@ -14,10 +14,20 @@ poetry run python list.py
|
|||||||
poetry run python download.py <dataset name>
|
poetry run python download.py <dataset name>
|
||||||
```
|
```
|
||||||
|
|
||||||
## Notes
|
To download all datasets
|
||||||
|
```
|
||||||
|
poetry run python download.py all
|
||||||
|
```
|
||||||
|
|
||||||
ssgetpy does not discriminate "real" datatype from "integer".
|
## What it does
|
||||||
Therefore, we have to do some filtering on the returned results.
|
|
||||||
|
Downloads subsets of the suitesparse collection to different directories.
|
||||||
|
For a given subset, the matrices are first downloaded to the `$SCRATCH/suitesparse` folder. If a required matrix already exists there, it is not redownloaded.
|
||||||
|
Then, a relative symlink is created from the `$SCRATCH/<subset>/<matrix>.mtx` file to the corresponding `.mtx` file in `$SCRATCH/suitesparse`.
|
||||||
|
|
||||||
|
This makes use of a [fork of the `ssgetpy`](github.com/cwpearson/ssgetpy) package with a faster download limit.
|
||||||
|
ssgetpy does not discriminate "real" datatype from "integer" datatype, as shown on the suitesparse collection website.
|
||||||
|
Therefore, `lists.py` maintains a manually-curated list of `integer` datatype matrices to facilitate discrimination.
|
||||||
|
|
||||||
## how this was done
|
## how this was done
|
||||||
|
|
||||||
|
Reference in New Issue
Block a user