Difference between revisions of "Class"

From
Jump to: navigation, search
Line 39: Line 39:
 
Multi-channel LSTM Network
 
Multi-channel LSTM Network
 
Keras wih TensorFlow
 
Keras wih TensorFlow
Utilize the CloVe and FastText Skipgram pretrained embeddings, allows he underlying network to access larger feature space to build complex features on top of.
+
Utilize the GloVe and FastText Skipgram pretrained embeddings, allows he underlying network to access larger feature space to build complex features on top of.
  
 
Can use utilize combinations of various corpus and embedding methods for better performance
 
Can use utilize combinations of various corpus and embedding methods for better performance
Line 46: Line 46:
  
 
Dense layer to project fnal output classification
 
Dense layer to project fnal output classification
 +
 +
Use embedding...
 +
embeddings = transfer learning 
 +
 +
? CNN vs BI-LSTM (RNN) this approach, BI-LSTM does not need a lot of data
 +
 +
Attention mechanism -- translate ... you can look back
 +
                    ... not a fixed vector size

Revision as of 13:01, 23 October 2018

https://courses.nvidia.com/dashboard

Linguistic Concepts

  • conference - anaphors
  • gang of four design
  • null subject
  • recursion

Word Embeddings

  • HMMS, CRF, PGMs
    • CBoW -Bag of Words / ngrams - feature per word/n items
    • 1-hot Sparse input - create a vector the size of the entire vocabulary
  • Stop Words
  • TF-IDF

Word2Vec

Skip-Gram


  • Firth 1957 Distributional Hypothess
  • Word Cloud

Text Classification

Text/Machine Translation (MNT)

Financial News

Tools:

  • Glove
    • dot product
  • FastText
    • Skipgram
    • Continuous bag of words

Multi-channel LSTM Network Keras wih TensorFlow Utilize the GloVe and FastText Skipgram pretrained embeddings, allows he underlying network to access larger feature space to build complex features on top of.

Can use utilize combinations of various corpus and embedding methods for better performance

Bidirectional LSTM network is used o encode sequential information on the embedding layers.

Dense layer to project fnal output classification

Use embedding... embeddings = transfer learning

? CNN vs BI-LSTM (RNN) this approach, BI-LSTM does not need a lot of data

Attention mechanism -- translate ... you can look back

                   ... not a fixed vector size