this post was submitted on 12 Jun 2023
39 points (100.0% liked)

Rust Lang

139 readers
1 users here now

Rules [Developing]

Observe our code of conduct

Constructive criticism only

No endless relitigation

No low-effort content

No memes or image macros

No NSFW Content

founded 1 year ago
MODERATORS
 

Any project in rust you want to chat about.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 3 points 1 year ago (1 children)

I started working on a text analysis crate during grad school. I've gotten some surface-level stuff in there already, but I'd like to flesh it out and add some more true NLP functionality.

Link: https://github.com/michael-long88/rnltk

[–] [email protected] 2 points 1 year ago (1 children)

Does this detect stylistic similarity between texts? I was thinking about porting a similar project that was developed to determine probable authorship. It would be quite useful as an open-source bot detector.

[–] [email protected] 1 points 1 year ago

Nothing quite that advanced. It's mostly just stemming, basic tokenization, TF-IDF, and cosine similarity at this point.