We had a kettle; we let it leak:
Our not repairing made it worse.
We haven`t had any tea for a week...
The bottom is out of the Universe.
― Rudyard Kipling
Not so much a project as a mixed bag of tools, queries, data and code snippets for use with Open Repair Data (ORDS).
For anyone who fancies tinkering with Python, NLP, SQL and lots of messy but interesting Open Data. Many challenges lie ahead!
Some of the things you can do:
There is also an installer and ORDS configuration for an Apache Solr search platform that demonstrates indexing and querying multi-lingual text with international (ICU) characters.
The dataset contains over 200k records representing over 15 years worth of electronic repairs at events by community repair groups all over the world. See the Setup section below for how to get hold of the data.
It is compiled and published by the Open Repair Alliance (ORA) an organisation founded by The Restart Project, a UK based charity.
python3 -m venv ords-tools
cd ords-tools
git init
git remote add origin git@github.com:openrepair/tools.git
git pull origin main
git branch --set-upstream-to=origin/main main
source bin/activate
pip install -r requirements.txt
Grab the latest Aggregated Open Repair Data files, unzip the files into the dat/ords directory.
Copy .env.example
to .env
and edit as necessary.
.env
is in .gitignore, do not add it to this repo.
Natural Language Processing Demystified