Difference between revisions of "Natural Language Tools & Services"

From
Jump to: navigation, search
m
 
(119 intermediate revisions by the same user not shown)
Line 2: Line 2:
 
|title=PRIMO.ai
 
|title=PRIMO.ai
 
|titlemode=append
 
|titlemode=append
|keywords=artificial, intelligence, machine, learning, models, algorithms, data, singularity, moonshot, Tensorflow, Google, Nvidia, Microsoft, Azure, Amazon, AWS  
+
|keywords=ChatGPT, artificial, intelligence, machine, learning, NLP, NLG, NLC, NLU, models, data, singularity, moonshot, Sentience, AGI, Emergence, Moonshot, Explainable, TensorFlow, Google, Nvidia, Microsoft, Azure, Amazon, AWS, Hugging Face, OpenAI, Tensorflow, OpenAI, Google, Nvidia, Microsoft, Azure, Amazon, AWS, Meta, LLM, metaverse, assistants, agents, digital twin, IoT, Transhumanism, Immersive Reality, Generative AI, Conversational AI, Perplexity, Bing, You, Bard, Ernie, prompt Engineering LangChain, Video/Image, Vision, End-to-End Speech, Synthesize Speech, Speech Recognition, Stanford, MIT |description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools
|description=Helpful resources for your journey with artificial intelligence; videos, articles, techniques, courses, profiles, and tools  
+
 
 +
<!-- Google tag (gtag.js) -->
 +
<script async src="https://www.googletagmanager.com/gtag/js?id=G-4GCWLBVJ7T"></script>
 +
<script>
 +
  window.dataLayer = window.dataLayer || [];
 +
  function gtag(){dataLayer.push(arguments);}
 +
  gtag('js', new Date());
 +
 
 +
  gtag('config', 'G-4GCWLBVJ7T');
 +
</script>
 
}}
 
}}
[http://www.youtube.com/results?search_query=tool+services+text+nlp+natural+language+workbench+toolchain+toolbox Youtube search...]
+
[https://www.youtube.com/results?search_query=ai+nlp+natural+language+Processing+tool+services YouTube]
[http://www.google.com/search?q=tool+services+text+nlp+natural+language+~workbench+deep+machine+learning+ML ...Google search]
+
[https://www.quora.com/search?q=ai%20nlp%20natural%20language%20Processing%20tool%20services ... Quora]
 +
[https://www.google.com/search?q=ai+nlp+natural+language+Processing+tool+services ...Google search]
 +
[https://news.google.com/search?q=ai+nlp+natural+language+Processing+tool+services ...Google News]
 +
[https://www.bing.com/news/search?q=ai+nlp+natural+language+Processing+tool+services&qft=interval%3d%228%22 ...Bing News]
  
* [[Natural Language Processing (NLP)]]
+
* [[Natural Language Processing (NLP)]] ... [[Natural Language Generation (NLG)|Generation (NLG)]] ... [[Natural Language Classification (NLC)|Classification (NLC)]] ... [[Natural Language Processing (NLP)#Natural Language Understanding (NLU)|Understanding (NLU)]] ... [[Language Translation|Translation]] ... [[Summarization]] ... [[Sentiment Analysis|Sentiment]] ... [[Natural Language Tools & Services|Tools]]
 +
* [[Large Language Model (LLM)]] ... [[Large Language Model (LLM)#Multimodal|Multimodal]] ... [[Foundation Models (FM)]] ... [[Generative Pre-trained Transformer (GPT)|Generative Pre-trained]] ... [[Transformer]] ... [[Attention]] ... [[Generative Adversarial Network (GAN)|GAN]] ... [[Bidirectional Encoder Representations from Transformers (BERT)|BERT]]
 +
* [[What is Artificial Intelligence (AI)? | Artificial Intelligence (AI)]] ... [[Generative AI]] ... [[Machine Learning (ML)]] ... [[Deep Learning]] ... [[Neural Network]] ... [[Reinforcement Learning (RL)|Reinforcement]] ... [[Learning Techniques]]
 +
* [[Conversational AI]] ... [[ChatGPT]] | [[OpenAI]] ... [[Bing/Copilot]] | [[Microsoft]] ... [[Gemini]] | [[Google]] ... [[Claude]] | [[Anthropic]] ... [[Perplexity]] ... [[You]] ... [[phind]] ... [[Ernie]] | [[Baidu]]
 +
* [[Agents]] ... [[Robotic Process Automation (RPA)|Robotic Process Automation]] ... [[Assistants]] ... [[Personal Companions]] ... [[Personal Productivity|Productivity]] ... [[Email]] ... [[Negotiation]] ... [[LangChain]]
 +
* [[Papers Search]]
 +
* [https://quillbot.com/ QuillBot] ...suite of tools employs cutting-edge AI technology in order to make writing painless
 +
* [http://goose.ai/ Goose.AI]  ...fully managed NLP-as-a-Service, [[Generative Pre-trained Transformer (GPT)]]-based language models delivered via API
 
* [http://en.wikipedia.org/wiki/Outline_of_natural_language_processing#Natural_language_processing_tools NLP Tools | Wikipedia]  
 
* [http://en.wikipedia.org/wiki/Outline_of_natural_language_processing#Natural_language_processing_tools NLP Tools | Wikipedia]  
 +
* [[Data Quality]] ... [[AI Verification and Validation|validity]], [[Evaluation - Measures#Accuracy|accuracy]], [[Data Quality#Data Cleaning|cleaning]], [[Data Quality#Data Completeness|completeness]], [[Data Quality#Data Consistency|consistency]], [[Data Quality#Data Encoding|encoding]], [[Data Quality#Zero Padding|padding]], [[Data Quality#Data Augmentation, Data Labeling, and Auto-Tagging|augmentation, labeling, auto-tagging]], [[Data Quality#Batch Norm(alization) & Standardization| normalization, standardization]], [[Data Quality#Imbalanced Data|imbalanced data]]
 +
* [http://venturebeat.com/2020/07/09/ai-researchers-create-testing-tool-to-find-bugs-in-nlp-from-amazon-google-and-microsoft/ AI researchers create testing tool to find bugs in NLP from Amazon, Google, and Microsoft | Khari Johnson - Venture Beat] ...[http://www.aclweb.org/anthology/2020.acl-main.442/ Beyond Accuracy: Behavioral Testing of NLP Models with CheckList | M. Ribeiro, T. Wu, C. Guestrin, and S. Singh]
 +
* [http://github.com/THUNLP-MT Machine Translation reading list & open-source toolkits | Tsinghua Natural Language Processing Group]
 +
* [http://www.lexalytics.com/lexablog/build-or-buy-natural-language-processing Build or Buy for Natural Language Processing? | Tim Mohler - Lexaytics]
 
* [http://nlp.stanford.edu/links/statnlp.html Statistical natural language processing and corpus-based computational linguistics: An annotated list of resources | Chris Manning - Stanford]
 
* [http://nlp.stanford.edu/links/statnlp.html Statistical natural language processing and corpus-based computational linguistics: An annotated list of resources | Chris Manning - Stanford]
* [http://github.com/NaturalNode/natural NaturalNode]  Node.js
 
 
* [http://cogcomp.org/page/software/ Software Packages | Cognitive Computation Group, led by Prof. Dan Roth]
 
* [http://cogcomp.org/page/software/ Software Packages | Cognitive Computation Group, led by Prof. Dan Roth]
 
* [http://github.com/salesforce/pytorch-qrnn PyTorch implementation of the Quasi-Recurrent Neural Network - up to 16 times faster than NVIDIA's cuDNN LSTM | gitHub]
 
* [http://github.com/salesforce/pytorch-qrnn PyTorch implementation of the Quasi-Recurrent Neural Network - up to 16 times faster than NVIDIA's cuDNN LSTM | gitHub]
 +
* [http://toolfinder.chpc.utah.edu/nlp-tool-finder NLP Tool Finder | NIH  University of Utah  DBMI]
 
* [http://www.kdnuggets.com/2018/12/activewizards-comparison-speech-processing-apis.html API-based Services]
 
* [http://www.kdnuggets.com/2018/12/activewizards-comparison-speech-processing-apis.html API-based Services]
* Capability:
+
* [http://medium.com/@daffl/natural-language-processing-and-machine-learning-in-javascript-249181a3b721 Natural Language Processing and Machine Learning in JavaScript | David Luecke - Medium]
** [[Natural Language Toolkit (NLTK)]]
+
* [http://blog.rapidapi.com/best-nlp-api/ Top 22 NLP (Natural Language Processing) APIs for Developers in 2018 | RapidAPI]
** [[H2O]] Driveless AI
+
* [http://www.sciencedirect.com/science/article/pii/S1532046417301685 Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review | K. Kreimeyera, M. Foster, A. Pandeya, N. Aryaa, G. Halford, S. Jones, R. Forsheea, M. Walderhauga, and T. Botsisa - Journal of Biomedical Informatics] 71 natural language processing systems
** [http://cloud.google.com/natural-language/ Cloud Natural Language (NL) | Google]
+
* [http://www.infoworld.com/article/3519413/8-great-python-libraries-for-natural-language-processing.html 8 great Python libraries for natural language processing | Serdar Yegulalp - InfoWorld]
 +
* [http://www.online-utility.org/text/analyzer.jsp Text Analyzer | Online-Utility.org]
 +
* [[Conversational AI]] ... [[ChatGPT]] | [[OpenAI]] ... [[Bing/Copilot]] | [[Microsoft]] ... [[Gemini]] | [[Google]] ... [[Claude]] | [[Anthropic]] ... [[Perplexity]] ... [[You]] ... [[phind]] ... [[Ernie]] | [[Baidu]]
 +
* [http://blog.google/technology/ai/lamda/  LaMDA (Language Model for Dialogue Applications): our breakthrough conversation technology | ] [[Google]]
 +
** [http://builtin.com/artificial-intelligence/google-lamda-ai-sentient Is Google’s LaMDA AI Truly Sentient? | Ari Moutafis - Builtin]
 +
 
 +
==== Capability with [[JavaScript]] ====
 +
* [[TensorFlow.js]] for training and deploying ML models in the browser and on [[JavaScript#Node.js|Node.js]] (was called Deeplearnjs)
 +
** [http://transcranial.github.io/keras-js/#/ Keras.js] No longer active - capability now is in TensorFlow.js
 +
* [http://www.npmjs.com/package/node-nlp NLP.js] NLP Manager: built on top of several other NLP libraries, including [http://github.com/wooorm/franc Franc] and [http://brain.js.org/#/ Brain.js] providing [[Natural Language Processing (NLP)#Text Classification|Text Classification]], [[Sentiment Analysis]], [[Natural Language Processing (NLP)#Stemming (Morphological Similarity)|Stemming (Morphological Similarity)]], [[Natural Language Processing (NLP)#Named Entity Recognition (NER)|Named Entity Recognition (NER)]], and natural language generation. (nodejs)
 +
* [http://compromise.cool/ Compromise] modest natural-language processing (NLP) interprets and pre-parses English and makes some reasonable decisions
 +
* [http://github.com/NaturalNode/natural Natural] provides [[Natural Language Processing (NLP)#Tokenization / Sentence Splitting|Tokenization / Sentence Splitting]], [[Natural Language Processing (NLP)#Stemming (Morphological Similarity)|Stemming (Morphological Similarity)]] (reducing a word to a not-necessarily morphological root), [[Natural Language Processing (NLP)#Text Classification|Text Classification]], phonetics, [[Term Frequency–Inverse Document Frequency (TF-IDF)]], [http://wordnet.princeton.edu/ WordNet], string [[Natural Language Processing (NLP)#Similarity|Similarity]], some inflections, and more. (nodejs)
 +
 
 +
==== Capability (other) ====
 +
* [http://allenai.org/ Allen Institute for Artificial Intelligence, or AI2]
 +
** [[Semantic Scholar]] ... free, AI-powered research tool for scientific literature
 +
* [[Google]]:
 +
** [[TensorFlow]]
 +
** [[Google Natural Language]] - AutoML Natural Language
 +
** [http://github.com/pair-code/lit Language Interpretability Tool (LIT) | Google - GitHub]
 
** [http://cloud.google.com/speech-to-text/ Cloud Speech-to-Text  | Google]
 
** [http://cloud.google.com/speech-to-text/ Cloud Speech-to-Text  | Google]
 +
** [[Google_Semantic_Reactor]] a [http://www.google.com/sheets/about/ Google Sheets] add-on
 +
* Amazon - AWS:
 +
** [[Textract]] | Amazon in the Elastic Stack Architecture
 
** [http://aws.amazon.com/comprehend/ Comprehend | Amazon]
 
** [http://aws.amazon.com/comprehend/ Comprehend | Amazon]
 
** [http://aws.amazon.com/transcribe/ Transcribe | Amazon]
 
** [http://aws.amazon.com/transcribe/ Transcribe | Amazon]
 
** [http://aws.amazon.com/polly/ Amazon Polly | Amazon]
 
** [http://aws.amazon.com/polly/ Amazon Polly | Amazon]
 +
** [http://aws.amazon.com/kendra/ Kendra | Amazon]
 +
* Microsoft - Asure:
 
** [http://azure.microsoft.com/en-us/services/cognitive-services/ Azure Cognitive Services | Microsoft]
 
** [http://azure.microsoft.com/en-us/services/cognitive-services/ Azure Cognitive Services | Microsoft]
 
** [http://azure.microsoft.com/en-us/services/cognitive-services/speech-services/ Azure Bing Speech API | Microsoft]
 
** [http://azure.microsoft.com/en-us/services/cognitive-services/speech-services/ Azure Bing Speech API | Microsoft]
** [http://www.ibm.com/watson/services/speech-to-text/ Speech to Text | IBM]
+
* Standford:
** [http://www.ibm.com/watson/services/text-to-speech/ Text to Speech | IBM]
+
** [http://stanfordnlp.github.io/CoreNLP/ CoreNLP | Stanford] The Stanford Natural Language Processing Group Toolkit ([[Python]])
** [http://www.programmableweb.com/api/voxsigma VoxSigma API]
+
** [http://stanfordnlp.github.io/stanza/ Stanza - a Python NLP Library for Many Human Languages] ...[http://www.infoq.com/news/2020/03/stanza-nlp-toolkit/ python NLP Toolkit]. Stanza features both a language-agnostic fully neural pipeline for text analysis (supporting 66 human languages), and a python interface to Stanford's CoreNLP java software
** [http://www.twilio.com/speech-recognition Speech Recognition |Twilio]
+
* [[Natural Language Toolkit (NLTK)]] ([[Python]]) implements [[Natural Language Processing (NLP)#Text Classification|Text Classification]], [[Natural Language Processing (NLP)#Tokenization / Sentence Splitting|Tokenization / Sentence Splitting]], [[Natural Language Processing (NLP)#Stemming (Morphological Similarity)|Stemming (Morphological Similarity)]], [[Natural Language Processing (NLP)#Part-of-Speech (POS) Tagging|Part-of-Speech (POS) Tagging]], [[Natural Language Processing (NLP)#Syntax (Parsing)|Syntax (Parsing)]], and semantic reasoning
** [http://www.speechmatics.com/tag/asr/ Automatic Speech Recognition (ASR) | Speechmatics]
+
** [http://textblob.readthedocs.io/en/dev/ TextBlob]([[Python]]) is kind of an extension of [[Natural Language Toolkit (NLTK)| NLTK]]. You can access many of [[Natural Language Toolkit (NLTK)| NLTK]]'s functions in a simplified manner; includes functionality from the Pattern library
** [http://www.nexmo.com/products/voice Voice API | Nexmo]
+
* [[SpaCy]] ([[Python]] and Cython)  everything as an object rather than a string, which simplifies the interface for building applications. Extract useful information from free text with built-in features to assist analysis, such as work tokeniser, named entity recognition, and part-of-speech detection. Spacy supports more than 55 languages.
** [http://wit.ai/ wit.ai]
+
** [http://github.com/chartbeat-labs/textacy textacy] ([[Python]]) focuses primarily on the tasks that come before and follow after [[SpaCy]]
** [http://www.meaningcloud.com/ Meaning Cloud]
+
* [[Python#scikit-learn|scikit-learn]] NLP toolkit
** [http://www.havenondemand.com/ Haven OnDemand]
+
* [http://opennlp.apache.org/ Apache OpenNLP]
** [http://aylien.com/text-analysis-platform/ Aylien]
+
* [http://fasttext.cc/ fastText |] [[Meta|Facebook]]'s AI Research representations and text classifiers ([[Python]])
** [http://www.lexalytics.com/ Lexalytics]
+
* [http://mallet.cs.umass.edu/ MALLET] a Java-based package
** [http://dialogflow.com/ Dialogflow]
+
* [http://www.intel.ai/nlp-architect-by-intel-ai-lab-release-0-2/ Intel NLP Architect] ([[Python]])
** [[SpaCy]] Python Library
+
* [[Gensim]] fast Vector Space Modelling, Topic Modeling, LDA implementation ([[Python]])
** [[Matlab]]
+
* [http://github.com/zalandoresearch/flair flair] use pretrained BERT (PyTorch)
** [[Sintelix]]
+
* [http://allennlp.org/ AllenNLP] an Apache NLP research library (PyTorch)
** [http://www.ibm.com/watson/services/natural-language-understanding/  Watson Natural Language Understanding | IBM]
+
* [http://pytorchnlp.readthedocs.io/en/latest/ Pytorch-NLP] (PyTorch)
** [http://indico.io/ Indico]
+
* [[Matlab]]
** [http://www.textrazor.com/ TextRazor]
+
* [[Sintelix]]
** [http://www.intellexer.com/products.html Intellexer]
+
* [[H2O]] Driveless AI
** [http://www.meaningcloud.com/ Meaning Cloud]
+
* [http://dandelion.eu/ Dandelion API]
** [http://github.com/salesforce/awd-lstm-lm LSTM and QRNN Language Model Toolkit for PyTorch | GitHub]
+
* [http://www.programmableweb.com/api/voxsigma VoxSigma API]
** [http://www.thematically.com Thematically Discover]
+
* [http://www.twilio.com/speech-recognition Speech Recognition |Twilio]
 +
* [http://www.speechmatics.com/tag/asr/ Automatic Speech Recognition (ASR) | Speechmatics]
 +
* [http://www.nexmo.com/products/voice Voice API | Nexmo]
 +
* [http://wit.ai/ wit.ai]
 +
* [http://www.meaningcloud.com/ Meaning Cloud]
 +
* [http://www.havenondemand.com/ Haven OnDemand]
 +
* [http://aylien.com/text-analysis-platform/ Aylien]
 +
* [http://www.lexalytics.com/ Lexalytics]
 +
* [http://dialogflow.com/ Dialogflow]
 +
* [http://indico.io/ Indico]
 +
* [http://www.textrazor.com/ TextRazor]
 +
* [http://www.intellexer.com/products.html Intellexer]
 +
* [http://www.meaningcloud.com/ Meaning Cloud]
 +
* [http://github.com/salesforce/awd-lstm-lm LSTM and QRNN Language Model Toolkit for PyTorch | GitHub]
 +
* [http://www.thematically.com Thematically Discover]
 +
* [http://github.com/clips/pattern Pattern] ([[Python]])
 +
* [http://github.com/aboSamoor/polyglot Polyglot]
 +
* [http://www.clarabridge.com/platform/analytics/ Clarabridge analytics]
 +
* [http://www.ibm.com/watson/services/speech-to-text/ Speech to Text | IBM]
 +
* [http://www.ibm.com/watson/services/text-to-speech/ Text to Speech | IBM]
 +
* [http://www.ibm.com/watson/services/natural-language-understanding/  Watson Natural Language Understanding | IBM]
 +
* [http://www.research.ibm.com/artificial-intelligence/project-debater/ Project Debater | IBM]
  
 +
===== Text Labeling =====
 +
* [http://github.com/dennybritz/bella Bella] open tool aimed at simplifying and speeding up text data labeling. Usually, if a dataset was labeled in a CSV file or Google spreadsheets, specialists need to convert it to an appropriate format before model training. Bella’s features and simple interface make it a good substitution to spreadsheets and CSV files. A graphical user interface (GUI) and a database backend for managing labeled data are Bella’s main features.
 +
* [http://www.tagtog.net/ Tagtog] choose three approaches: annotate text manually, hire a team that will label data for them, or use machine learning models for automated annotation.
 +
* [http://dataturks.com/index.php Dataturks] provides training data preparation tools. Using its products, teams can perform such tasks as parts-of-speech tagging, named-entity recognition tagging, text classification, moderation, and summarization.
 +
* [http://brat.nlplab.org/ Brat] rapid annotation tool] a web-based tool for text annotation; that is, for adding notes to existing text documents, designed in particular for structured annotation, where the notes are not freeform text but have a fixed form that can be automatically processed and interpreted by a computer.
 +
* [http://github.com/jiesutd/YEDDA Yedda] a lightweight Collaborative Text Span Annotation Tool developed for annotating chunk/entity/event on text (almost all languages including English, [[Government Services#China|Chinese]]), symbol and even emoji.
 +
 +
 +
<youtube>GSgkMoGrHws</youtube>
 +
<youtube>9DE4WV5w6zM</youtube>
 
<youtube>Y2wgQjxrPD8</youtube>
 
<youtube>Y2wgQjxrPD8</youtube>
 
<youtube>DDYS6Yhq1_E</youtube>
 
<youtube>DDYS6Yhq1_E</youtube>
Line 56: Line 134:
 
<youtube>G4XVT3LRxTg</youtube>
 
<youtube>G4XVT3LRxTg</youtube>
 
<youtube>yT58gTXdQb8</youtube>
 
<youtube>yT58gTXdQb8</youtube>
<youtube>9DE4WV5w6zM</youtube>
+
 
  
 
* [[Automated Scoring]]
 
* [[Automated Scoring]]

Latest revision as of 10:40, 28 May 2025

YouTube ... Quora ...Google search ...Google News ...Bing News

Capability with JavaScript

Capability (other)

Text Labeling
  • Bella open tool aimed at simplifying and speeding up text data labeling. Usually, if a dataset was labeled in a CSV file or Google spreadsheets, specialists need to convert it to an appropriate format before model training. Bella’s features and simple interface make it a good substitution to spreadsheets and CSV files. A graphical user interface (GUI) and a database backend for managing labeled data are Bella’s main features.
  • Tagtog choose three approaches: annotate text manually, hire a team that will label data for them, or use machine learning models for automated annotation.
  • Dataturks provides training data preparation tools. Using its products, teams can perform such tasks as parts-of-speech tagging, named-entity recognition tagging, text classification, moderation, and summarization.
  • Brat rapid annotation tool] a web-based tool for text annotation; that is, for adding notes to existing text documents, designed in particular for structured annotation, where the notes are not freeform text but have a fixed form that can be automatically processed and interpreted by a computer.
  • Yedda a lightweight Collaborative Text Span Annotation Tool developed for annotating chunk/entity/event on text (almost all languages including English, Chinese), symbol and even emoji.