We recently released an open source project that syncs wikipedia with a vector database : https://github.com/Piazza-tech/Piazza-Updater
We used Verba, Weaviate and Docker for deployment
We'd like to have some feedback on how to continue the project, which data sources would be interesting to vectorize. You can give feedback on our landing page http://piazza.tech Please leave a star !