Hi Jitendra
Recently i was reading about how to filter profanity.
Some good solutions that i found out were:
1. http://www.purgomalum.com/
//PurgoMalum is a simple, free, RESTful web service for filtering and removing content of profanity, obscenity and other unwanted text.
2. http://wiki.cdyne.com/wiki/index.php?title=Profanity_Filter
// FREE Profanity Filter API is a simple [CDYNE is no longer maintaining this product.]
3.https://www.webpurify.com/
Algorithms
1. Scunthorpe problem:- https://en.wikipedia.org/wiki/Scunthorpe_problem
2. Metaphone :- https://en.wikipedia.org/wiki/Metaphone
3. Aho–Corasick algorithm:- https://en.wikipedia.org/wiki/Aho%E2%80%93Corasick_algorithm
4. Wordfilter :- https://en.wikipedia.org/wiki/Wordfilter
OpenSource
1. http://www.unknownerror.org/opensource/boothj5/profanity/q/stackoverflow/4985124/profanity-filter-java
Language modeling toolkits are useful for rectifying poorly formatted text. language modeling uses n-gram approach. Some of the popular language modeling tools are
- http://www.speech.cs.cmu.edu/SLM
- http://www-speech.sri.com/projec
- http://alias-i.com/lingpipe/demo
I would say implement your own logic based on the algorithms mentioned, if time is problem them go for Web based solution as they provide NLP based solutions and least preferable option would be to use word filter.
And yes, Link:- http://blogs.adobe.com/contentmanagement/tag/profanity/ is also a solution.
Thanks and Regards
Kautuk Sahni