Difference between revisions of "SpaCy"

Revision as of 19:46, 14 August 2019

spaCy excels at large-scale information extraction tasks. It's written from the ground up in carefully memory-managed Cython. Independent research has confirmed that spaCy is the fastest in the world. If your application needs to process entire web dumps, spaCy is the library you want to be using. spaCy v2.0 features new neural models for tagging, parsing and entity recognition. The models have been designed and implemented from scratch specifically for spaCy, to give you an unmatched balance of speed, size and accuracy. A novel bloom embedding strategy with subword features is used to support huge vocabularies in tiny tables. Convolutional layers with residual connections, layer normalization and maxout non-linearity are used, giving much better efficiency than the standard BiLSTM solution. Finally, the parser and NER use an imitation learning objective to deliver accuracy in-line with the latest research systems, even when evaluated from raw text. With these innovations, spaCy v2.0's models are 10× smaller, 20% more accurate, and even cheaper to run than the previous generation.

@@ Line 7: / Line 7: @@
 [http://www.youtube.com/results?search_query=SpaCy Youtube search...]
 [http://www.google.com/search?q=SpaCy+deep+machine+learning+ML+artificial+intelligence ...Google search]
 * [[Natural Language Processing (NLP)]]
@@ Line 14: / Line 13: @@
 * [http://spacy.io/ spaCy - Industrial-Strength Natural Language Processing]
 * [[Feature Exploration/Learning]]
+* [[Python]]
 spaCy excels at large-scale information extraction tasks. It's written from the ground up in carefully memory-managed Cython. Independent research has confirmed that spaCy is the fastest in the world. If your application needs to process entire web dumps, spaCy is the library you want to be using. spaCy v2.0 features new neural models for tagging, parsing and entity recognition. The models have been designed and implemented from scratch specifically for spaCy, to give you an unmatched balance of speed, size and accuracy. A novel bloom embedding strategy with subword features is used to support huge vocabularies in tiny tables. Convolutional layers with residual connections, layer normalization and maxout non-linearity are used, giving much better efficiency than the standard BiLSTM solution. Finally, the parser and NER use an imitation learning objective to deliver accuracy in-line with the latest research systems, even when evaluated from raw text. With these innovations, spaCy v2.0's models are 10× smaller, 20% more accurate, and even cheaper to run than the previous generation.

Difference between revisions of "SpaCy"

Revision as of 19:46, 14 August 2019

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools