Difference between revisions of "Natural Language Tools & Services"

From
Jump to: navigation, search
m
 
(146 intermediate revisions by the same user not shown)
Line 1: Line 1:
 +
{{#seo:
 +
|title=PRIMO.ai
 +
|titlemode=append
 +
|keywords=ChatGPT, artificial, intelligence, machine, learning, NLP, NLG, NLC, NLU, models, data, singularity, moonshot, Sentience, AGI, Emergence, Moonshot, Explainable, TensorFlow, Google, Nvidia, Microsoft, Azure, Amazon, AWS, Hugging Face, OpenAI, Tensorflow, OpenAI, Google, Nvidia, Microsoft, Azure, Amazon, AWS, Meta, LLM, metaverse, assistants, agents, digital twin, IoT, Transhumanism, Immersive Reality, Generative AI, Conversational AI, Perplexity, Bing, You, Bard, Ernie, prompt Engineering LangChain, Video/Image, Vision, End-to-End Speech, Synthesize Speech, Speech Recognition, Stanford, MIT |description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools 
 +
 +
<!-- Google tag (gtag.js) -->
 +
<script async src="https://www.googletagmanager.com/gtag/js?id=G-4GCWLBVJ7T"></script>
 +
<script>
 +
  window.dataLayer = window.dataLayer || [];
 +
  function gtag(){dataLayer.push(arguments);}
 +
  gtag('js', new Date());
 +
 +
  gtag('config', 'G-4GCWLBVJ7T');
 +
</script>
 +
}}
 +
[https://www.youtube.com/results?search_query=ai+nlp+natural+language+Processing+tool+services YouTube]
 +
[https://www.quora.com/search?q=ai%20nlp%20natural%20language%20Processing%20tool%20services ... Quora]
 +
[https://www.google.com/search?q=ai+nlp+natural+language+Processing+tool+services ...Google search]
 +
[https://news.google.com/search?q=ai+nlp+natural+language+Processing+tool+services ...Google News]
 +
[https://www.bing.com/news/search?q=ai+nlp+natural+language+Processing+tool+services&qft=interval%3d%228%22 ...Bing News]
 +
 +
* [[Natural Language Processing (NLP)]] ... [[Natural Language Generation (NLG)|Generation (NLG)]] ... [[Natural Language Classification (NLC)|Classification (NLC)]] ... [[Natural Language Processing (NLP)#Natural Language Understanding (NLU)|Understanding (NLU)]] ... [[Language Translation|Translation]] ... [[Summarization]] ... [[Sentiment Analysis|Sentiment]] ... [[Natural Language Tools & Services|Tools]]
 +
* [[Large Language Model (LLM)]] ... [[Large Language Model (LLM)#Multimodal|Multimodal]] ... [[Foundation Models (FM)]] ... [[Generative Pre-trained Transformer (GPT)|Generative Pre-trained]] ... [[Transformer]] ... [[Attention]] ... [[Generative Adversarial Network (GAN)|GAN]] ... [[Bidirectional Encoder Representations from Transformers (BERT)|BERT]]
 +
* [[What is Artificial Intelligence (AI)? | Artificial Intelligence (AI)]] ... [[Generative AI]] ... [[Machine Learning (ML)]] ... [[Deep Learning]] ... [[Neural Network]] ... [[Reinforcement Learning (RL)|Reinforcement]] ... [[Learning Techniques]]
 +
* [[Conversational AI]] ... [[ChatGPT]] | [[OpenAI]] ... [[Bing/Copilot]] | [[Microsoft]] ... [[Gemini]] | [[Google]] ... [[Claude]] | [[Anthropic]] ... [[Perplexity]] ... [[You]] ... [[phind]] ... [[Ernie]] | [[Baidu]]
 +
* [[Agents]] ... [[Robotic Process Automation (RPA)|Robotic Process Automation]] ... [[Assistants]] ... [[Personal Companions]] ... [[Personal Productivity|Productivity]] ... [[Email]] ... [[Negotiation]] ... [[LangChain]]
 +
* [[Papers Search]]
 +
* [https://quillbot.com/ QuillBot] ...suite of tools employs cutting-edge AI technology in order to make writing painless
 +
* [http://goose.ai/ Goose.AI]  ...fully managed NLP-as-a-Service, [[Generative Pre-trained Transformer (GPT)]]-based language models delivered via API
 
* [http://en.wikipedia.org/wiki/Outline_of_natural_language_processing#Natural_language_processing_tools NLP Tools | Wikipedia]  
 
* [http://en.wikipedia.org/wiki/Outline_of_natural_language_processing#Natural_language_processing_tools NLP Tools | Wikipedia]  
 +
* [[Data Quality]] ... [[AI Verification and Validation|validity]], [[Evaluation - Measures#Accuracy|accuracy]], [[Data Quality#Data Cleaning|cleaning]], [[Data Quality#Data Completeness|completeness]], [[Data Quality#Data Consistency|consistency]], [[Data Quality#Data Encoding|encoding]], [[Data Quality#Zero Padding|padding]], [[Data Quality#Data Augmentation, Data Labeling, and Auto-Tagging|augmentation, labeling, auto-tagging]], [[Data Quality#Batch Norm(alization) & Standardization| normalization, standardization]], [[Data Quality#Imbalanced Data|imbalanced data]]
 +
* [http://venturebeat.com/2020/07/09/ai-researchers-create-testing-tool-to-find-bugs-in-nlp-from-amazon-google-and-microsoft/ AI researchers create testing tool to find bugs in NLP from Amazon, Google, and Microsoft | Khari Johnson - Venture Beat] ...[http://www.aclweb.org/anthology/2020.acl-main.442/ Beyond Accuracy: Behavioral Testing of NLP Models with CheckList | M. Ribeiro, T. Wu, C. Guestrin, and S. Singh]
 +
* [http://github.com/THUNLP-MT Machine Translation reading list & open-source toolkits | Tsinghua Natural Language Processing Group]
 +
* [http://www.lexalytics.com/lexablog/build-or-buy-natural-language-processing Build or Buy for Natural Language Processing? | Tim Mohler - Lexaytics]
 +
* [http://nlp.stanford.edu/links/statnlp.html Statistical natural language processing and corpus-based computational linguistics: An annotated list of resources | Chris Manning - Stanford]
 +
* [http://cogcomp.org/page/software/ Software Packages | Cognitive Computation Group, led by Prof. Dan Roth]
 +
* [http://github.com/salesforce/pytorch-qrnn PyTorch implementation of the Quasi-Recurrent Neural Network - up to 16 times faster than NVIDIA's cuDNN LSTM | gitHub]
 +
* [http://toolfinder.chpc.utah.edu/nlp-tool-finder NLP Tool Finder | NIH  University of Utah  DBMI]
 +
* [http://www.kdnuggets.com/2018/12/activewizards-comparison-speech-processing-apis.html API-based Services]
 +
* [http://medium.com/@daffl/natural-language-processing-and-machine-learning-in-javascript-249181a3b721 Natural Language Processing and Machine Learning in JavaScript | David Luecke - Medium]
 +
* [http://blog.rapidapi.com/best-nlp-api/ Top 22 NLP (Natural Language Processing) APIs for Developers in 2018 | RapidAPI]
 +
* [http://www.sciencedirect.com/science/article/pii/S1532046417301685 Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review | K. Kreimeyera, M. Foster, A. Pandeya, N. Aryaa, G. Halford, S. Jones, R. Forsheea, M. Walderhauga, and T. Botsisa - Journal of Biomedical Informatics] 71 natural language processing systems
 +
* [http://www.infoworld.com/article/3519413/8-great-python-libraries-for-natural-language-processing.html 8 great Python libraries for natural language processing | Serdar Yegulalp - InfoWorld]
 +
* [http://www.online-utility.org/text/analyzer.jsp Text Analyzer | Online-Utility.org]
 +
* [[Conversational AI]] ... [[ChatGPT]] | [[OpenAI]] ... [[Bing/Copilot]] | [[Microsoft]] ... [[Gemini]] | [[Google]] ... [[Claude]] | [[Anthropic]] ... [[Perplexity]] ... [[You]] ... [[phind]] ... [[Ernie]] | [[Baidu]]
 +
* [http://blog.google/technology/ai/lamda/  LaMDA (Language Model for Dialogue Applications): our breakthrough conversation technology | ] [[Google]]
 +
** [http://builtin.com/artificial-intelligence/google-lamda-ai-sentient Is Google’s LaMDA AI Truly Sentient? | Ari Moutafis - Builtin]
  
* [http://nlp.stanford.edu/links/statnlp.html Statistical natural language processing and corpus-based computational linguistics: An annotated list of resources | Stanford]
+
==== Capability with [[JavaScript]] ====
* [[Natural Language Toolkit (NLTK)]]
+
* [[TensorFlow.js]] for training and deploying ML models in the browser and on [[JavaScript#Node.js|Node.js]] (was called Deeplearnjs)
* [[SpaCy]] Python Library
+
** [http://transcranial.github.io/keras-js/#/ Keras.js] No longer active - capability now is in TensorFlow.js
* [http://cogcomp.org/page/software/ Software Packages | Cognitive Computation Group, led by Prof. Dan Roth]
+
* [http://www.npmjs.com/package/node-nlp NLP.js] NLP Manager: built on top of several other NLP libraries, including [http://github.com/wooorm/franc Franc] and [http://brain.js.org/#/ Brain.js] providing [[Natural Language Processing (NLP)#Text Classification|Text Classification]], [[Sentiment Analysis]], [[Natural Language Processing (NLP)#Stemming (Morphological Similarity)|Stemming (Morphological Similarity)]], [[Natural Language Processing (NLP)#Named Entity Recognition (NER)|Named Entity Recognition (NER)]], and natural language generation. (nodejs)
 +
* [http://compromise.cool/ Compromise] modest natural-language processing (NLP) interprets and pre-parses English and makes some reasonable decisions
 +
* [http://github.com/NaturalNode/natural Natural] provides [[Natural Language Processing (NLP)#Tokenization / Sentence Splitting|Tokenization / Sentence Splitting]], [[Natural Language Processing (NLP)#Stemming (Morphological Similarity)|Stemming (Morphological Similarity)]] (reducing a word to a not-necessarily morphological root), [[Natural Language Processing (NLP)#Text Classification|Text Classification]], phonetics, [[Term Frequency–Inverse Document Frequency (TF-IDF)]], [http://wordnet.princeton.edu/ WordNet], string [[Natural Language Processing (NLP)#Similarity|Similarity]], some inflections, and more. (nodejs)
 +
 
 +
==== Capability (other) ====
 +
* [http://allenai.org/ Allen Institute for Artificial Intelligence, or AI2]
 +
** [[Semantic Scholar]] ... free, AI-powered research tool for scientific literature
 +
* [[Google]]:
 +
** [[TensorFlow]]
 +
** [[Google Natural Language]] - AutoML Natural Language
 +
** [http://github.com/pair-code/lit Language Interpretability Tool (LIT) | Google - GitHub]
 +
** [http://cloud.google.com/speech-to-text/ Cloud Speech-to-Text  | Google]
 +
** [[Google_Semantic_Reactor]] a [http://www.google.com/sheets/about/ Google Sheets] add-on
 +
* Amazon - AWS:
 +
** [[Textract]] | Amazon in the Elastic Stack Architecture
 +
** [http://aws.amazon.com/comprehend/ Comprehend | Amazon]
 +
** [http://aws.amazon.com/transcribe/ Transcribe | Amazon]
 +
** [http://aws.amazon.com/polly/ Amazon Polly | Amazon]
 +
** [http://aws.amazon.com/kendra/ Kendra | Amazon]
 +
* Microsoft - Asure:
 +
** [http://azure.microsoft.com/en-us/services/cognitive-services/ Azure Cognitive Services | Microsoft]
 +
** [http://azure.microsoft.com/en-us/services/cognitive-services/speech-services/ Azure Bing Speech API | Microsoft]
 +
* Standford:
 +
** [http://stanfordnlp.github.io/CoreNLP/ CoreNLP | Stanford] The Stanford Natural Language Processing Group Toolkit ([[Python]])
 +
** [http://stanfordnlp.github.io/stanza/ Stanza - a Python NLP Library for Many Human Languages] ...[http://www.infoq.com/news/2020/03/stanza-nlp-toolkit/ python NLP Toolkit]. Stanza features both a language-agnostic fully neural pipeline for text analysis (supporting 66 human languages), and a python interface to Stanford's CoreNLP java software
 +
* [[Natural Language Toolkit (NLTK)]] ([[Python]]) implements [[Natural Language Processing (NLP)#Text Classification|Text Classification]], [[Natural Language Processing (NLP)#Tokenization / Sentence Splitting|Tokenization / Sentence Splitting]], [[Natural Language Processing (NLP)#Stemming (Morphological Similarity)|Stemming (Morphological Similarity)]], [[Natural Language Processing (NLP)#Part-of-Speech (POS) Tagging|Part-of-Speech (POS) Tagging]], [[Natural Language Processing (NLP)#Syntax (Parsing)|Syntax (Parsing)]], and semantic reasoning
 +
** [http://textblob.readthedocs.io/en/dev/ TextBlob]([[Python]]) is kind of an extension of [[Natural Language Toolkit (NLTK)| NLTK]]. You can access many of [[Natural Language Toolkit (NLTK)| NLTK]]'s functions in a simplified manner; includes functionality from the Pattern library
 +
* [[SpaCy]] ([[Python]] and Cython)  everything as an object rather than a string, which simplifies the interface for building applications. Extract useful information from free text with built-in features to assist analysis, such as work tokeniser, named entity recognition, and part-of-speech detection. Spacy supports more than 55 languages.
 +
** [http://github.com/chartbeat-labs/textacy textacy] ([[Python]]) focuses primarily on the tasks that come before and follow after [[SpaCy]]
 +
* [[Python#scikit-learn|scikit-learn]] NLP toolkit
 +
* [http://opennlp.apache.org/ Apache OpenNLP]
 +
* [http://fasttext.cc/ fastText |] [[Meta|Facebook]]'s AI Research representations and text classifiers ([[Python]])
 +
* [http://mallet.cs.umass.edu/ MALLET] a Java-based package
 +
* [http://www.intel.ai/nlp-architect-by-intel-ai-lab-release-0-2/ Intel NLP Architect] ([[Python]])
 +
* [[Gensim]] fast Vector Space Modelling, Topic Modeling, LDA implementation ([[Python]])
 +
* [http://github.com/zalandoresearch/flair flair] use pretrained BERT (PyTorch)
 +
* [http://allennlp.org/ AllenNLP] an Apache NLP research library (PyTorch)
 +
* [http://pytorchnlp.readthedocs.io/en/latest/ Pytorch-NLP] (PyTorch)
 +
* [[Matlab]]
 +
* [[Sintelix]]
 +
* [[H2O]] Driveless AI
 +
* [http://dandelion.eu/ Dandelion API]
 +
* [http://www.programmableweb.com/api/voxsigma VoxSigma API]
 +
* [http://www.twilio.com/speech-recognition Speech Recognition |Twilio]
 +
* [http://www.speechmatics.com/tag/asr/ Automatic Speech Recognition (ASR) | Speechmatics]
 +
* [http://www.nexmo.com/products/voice Voice API | Nexmo]
 +
* [http://wit.ai/ wit.ai]
 +
* [http://www.meaningcloud.com/ Meaning Cloud]
 +
* [http://www.havenondemand.com/ Haven OnDemand]
 +
* [http://aylien.com/text-analysis-platform/ Aylien]
 +
* [http://www.lexalytics.com/ Lexalytics]
 +
* [http://dialogflow.com/ Dialogflow]
 +
* [http://indico.io/ Indico]
 +
* [http://www.textrazor.com/ TextRazor]
 +
* [http://www.intellexer.com/products.html Intellexer]
 +
* [http://www.meaningcloud.com/ Meaning Cloud]
 +
* [http://github.com/salesforce/awd-lstm-lm LSTM and QRNN Language Model Toolkit for PyTorch | GitHub]
 +
* [http://www.thematically.com Thematically Discover]
 +
* [http://github.com/clips/pattern Pattern] ([[Python]])
 +
* [http://github.com/aboSamoor/polyglot Polyglot]
 +
* [http://www.clarabridge.com/platform/analytics/ Clarabridge analytics]
 +
* [http://www.ibm.com/watson/services/speech-to-text/ Speech to Text | IBM]
 +
* [http://www.ibm.com/watson/services/text-to-speech/ Text to Speech | IBM]
 +
* [http://www.ibm.com/watson/services/natural-language-understanding/  Watson Natural Language Understanding | IBM]
 +
* [http://www.research.ibm.com/artificial-intelligence/project-debater/ Project Debater | IBM]
 +
 
 +
===== Text Labeling =====
 +
* [http://github.com/dennybritz/bella Bella] open tool aimed at simplifying and speeding up text data labeling. Usually, if a dataset was labeled in a CSV file or Google spreadsheets, specialists need to convert it to an appropriate format before model training. Bella’s features and simple interface make it a good substitution to spreadsheets and CSV files. A graphical user interface (GUI) and a database backend for managing labeled data are Bella’s main features.
 +
* [http://www.tagtog.net/ Tagtog] choose three approaches: annotate text manually, hire a team that will label data for them, or use machine learning models for automated annotation.
 +
* [http://dataturks.com/index.php Dataturks] provides training data preparation tools. Using its products, teams can perform such tasks as parts-of-speech tagging, named-entity recognition tagging, text classification, moderation, and summarization.
 +
* [http://brat.nlplab.org/ Brat] rapid annotation tool] a web-based tool for text annotation; that is, for adding notes to existing text documents, designed in particular for structured annotation, where the notes are not freeform text but have a fixed form that can be automatically processed and interpreted by a computer.
 +
* [http://github.com/jiesutd/YEDDA Yedda] a lightweight Collaborative Text Span Annotation Tool developed for annotating chunk/entity/event on text (almost all languages including English, [[Government Services#China|Chinese]]), symbol and even emoji.
 +
 
 +
 
 +
<youtube>GSgkMoGrHws</youtube>
 +
<youtube>9DE4WV5w6zM</youtube>
 +
<youtube>Y2wgQjxrPD8</youtube>
 +
<youtube>DDYS6Yhq1_E</youtube>
 +
<youtube>mVNHYdjeLAY</youtube>
 +
<youtube>T63LDIRbwxw</youtube>
 +
<youtube>J5IlKj7H8T8</youtube>
 +
<youtube>WDItryHZ3JI</youtube>
 +
<youtube>G4XVT3LRxTg</youtube>
 +
<youtube>yT58gTXdQb8</youtube>
 +
 
 +
 
 +
* [[Automated Scoring]]

Latest revision as of 10:40, 28 May 2025

YouTube ... Quora ...Google search ...Google News ...Bing News

Capability with JavaScript

Capability (other)

Text Labeling
  • Bella open tool aimed at simplifying and speeding up text data labeling. Usually, if a dataset was labeled in a CSV file or Google spreadsheets, specialists need to convert it to an appropriate format before model training. Bella’s features and simple interface make it a good substitution to spreadsheets and CSV files. A graphical user interface (GUI) and a database backend for managing labeled data are Bella’s main features.
  • Tagtog choose three approaches: annotate text manually, hire a team that will label data for them, or use machine learning models for automated annotation.
  • Dataturks provides training data preparation tools. Using its products, teams can perform such tasks as parts-of-speech tagging, named-entity recognition tagging, text classification, moderation, and summarization.
  • Brat rapid annotation tool] a web-based tool for text annotation; that is, for adding notes to existing text documents, designed in particular for structured annotation, where the notes are not freeform text but have a fixed form that can be automatically processed and interpreted by a computer.
  • Yedda a lightweight Collaborative Text Span Annotation Tool developed for annotating chunk/entity/event on text (almost all languages including English, Chinese), symbol and even emoji.