Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

[P] For NLP Researchers, Implementation of Text Preprocessing Package, PreNLP

Do very simple text-preprocessing (a.k.a dirty work) with PreNLP Package !

I’m working in NLP part, and implementing a package to do iterative but necessary works for NLP. I want to know what you want to implement on the issue. I’ll implement it on this package.

Here are some exmaples to preprocess text.

from prenlp.data.normalization import * >>> url_normalize('Visit this link for more details: https://github.com/', repl='[URL]') Visit this link for more details: [URL] >>> tag_normalize('Use HTML with the desired attributes: <img src="cat.jpg" height="100" />', repl='[TAG]') Use HTML with the desired attributes: [TAG] >>> emoji_normalize('Hello 🤩, I love you 💓 !', repl='[EMOJI]') Hello [EMOJI], I love you [EMOJI] ! >>> email_normalize('Contact me at lyeoni.g@gmail.com', repl='[EMAIL]') Contact me at [EMAIL] >>> tel_normalize('Call +82 10-1234-5678', repl='[TEL]') Call [TEL] 

LINK: https://github.com/lyeoni/prenlp

submitted by /u/lyeoni
[link] [comments]