this post was submitted on 25 Nov 2023
5 points (85.7% liked)

Self-Hosted Main

511 readers
1 users here now

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.

For Example

We welcome posts that include suggestions for good self-hosted alternatives to popular online services, how they are better, or how they give back control of your data. Also include hints and tips for less technical readers.

Useful Lists

founded 1 year ago
MODERATORS
 

So I'm setting up a home server for personal purposes, among which, storing personal documents/files.. accumulated over the years is now my next target. I've already had:

- Nextcloud to upload/sync files from my mobile devices

- Calibre to manage ebooks, magazines

- Jellyfin to manage multimedia files, including photos

I'm looking for a solution to upload/manage documents. Could be my Ids, could be my rental contract, or recovery passphrases for my accounts, etc.

There are a lot that can be found from here:

https://github.com/awesome-selfhosted/awesome-selfhosted#document-management---integrated-library-systems-ils

But I'd love to get real experience and advice. Something that can run with docker and maybe, probably integrate nicely with the rest of the above stuff.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 3 points 11 months ago (1 children)

Nextcloud works great for document management, if you additionally install tesseract OCR and Elasticsearch. Then you can use any smartphone document-scanner (I personally use "swift scan") to add new documents via WebDAV Upload, but I think most of them support WebDAV nowadays. The Nextcloud app even has a document scanner feature built in, but it's not very good.

I have been reading about the features of paperless-ng and I don't see what that software additionally brings to the table that a properly setup nextcloud cannot do. Only that I have Nextcloud anyways and it can do much more than document management and I love to have all aspects of my "personal cloud" in one software tool.

[–] [email protected] 2 points 11 months ago

Tesseract OCR

I am very unclear on what happens with the OCR output and how that is associated with the file and how it os searched?