Hi everyone 👋!
This is Husein Zolkepli, and I like to spend my time developing Malaya library, a Natural-Language-Toolkit library for Bahasa Malaysia, powered by Deep Learning Tensorflow and publish more Bahasa Malaysia dataset and corpus, Malay-Dataset.
Full documentation, malaya.readthedocs.io/
Malaya can do a lot of things with just less than 5 lines of code,
Malaya also released Bahasa pretrained models, simply check at Malaya/pretrained-model
Or can try to use huggingface 🤗 Transformers library, huggingface.co/models?filter=malay
Malaya development already been recognized by MDEC and MIGHT, and stated by them, 'to prepare Malaysia for Industry 4.0'. Modern NLP is all about smart interfacing humans with machines.
We spent more than RM100k to released pre-trained and fine-tuned models, so if you run a business and using Malaya library or Malay-Dataset in a revenue-generating product, it would make business sense to sponsor Malaya development, or do some researches and found Malaya library or Malay-Dataset are really helpful, feel free to donate.
These what I am going to do if get more initiatives,