The Gradient

Sequence GANs in a Nutshell

Posted on 2020-08-30 In NLP , NLG , GAN Disqus:

Background: Conventional maximum likelihood approaches for sequence generation with teacher forcing algorithms are inherently prone to exposure bias at the inference stage due to the training-testing discrepancy—the generator produces a sequence iteratively conditioned on its previously predicted ones that may be never observed during training—leading to accumulative mismatch with the increment of generated sequences. In other words, the model is only trained on demonstrated behaviors (real data samples) but not free-running mode.
Generative Adversarial Networks (GANs) hold the promise of mitigating such issues for generating discrete sequences, such as language modeling, speech/music generation, etc.

Automatic Evaluation Metrics for Language Generation

Posted on 2020-06-05 In NLP , NLG , NLG Evaluation Disqus:

A summary of the automatic evaluation metric for natural language generation (NLG) applications.

The human evaluation considers the aspects of adequacy, fidelity, and fluency, but it is quite expensive.

Adequacy: Does the output convey the same meaning as the input sentence? Is part of the message lost, added, or distorted?
Fluency: Is the output good fluent English? This involves both grammatical correctness and idiomatic word choices.

Thus, a useful metric for automatic evaluation in NLG applications holds the promise, such as machine translation, text summarization, image captioning, dialogue generation, poetry/story generation, etc.

Shell Command Notes

Posted on 2020-05-12 In Shell Disqus:

A summary of helpful bash command sheets.

Image Captioning: A Summary!

Posted on 2020-05-01 In Vision & Language , Image Captioning Disqus:

A summary of image-to-text translation.

An Introduction to Capsules

Posted on 2020-04-23 In NN , Capsule Disqus:

A capsule is defined as a group of neuron instantiations whose parameters represent specific properties of a specific type of entity. Here is a brief note of Capsule networks^[1]^[2].

Decoding in Text Generation

Posted on 2020-04-21 In NLP , Conditional LM , Decoding Disqus:

Summary of common decoding strategies in language generation.

Sparse Matrix in Data Processing

Posted on 2020-04-03 In Programming practical , Numerical computation Disqus:

It is wasteful to store zeros elements in a sparse matrix, especially for incrementally data. When constructing tf-idf and bag-of-words features or saving graph ajacent matrix, non-efficient sparse matrix storage might lead to the memory error. To circumvent this problems, efficient sparse matrix storage is a choice.

Clustering Methods: A Note

Posted on 2020-03-20 In ML , Clustering Disqus:

Notes of clustering approaches.

An Introduction to Graph Neural Networks

Posted on 2020-03-16 In Graph Neural Networks Disqus:

Graph Neural Networks (GNNs) has demonstrated efficacy on non-Euclidean data, such as social media, bioinformatics, etc.

Image source: ^[1]

Generative Adversarial Networks

Posted on 2020-01-15 In Unsupervised learning , GAN Disqus:

GANs are widely applied to estimate generative models without any explicit density function, which instead take the game-theoretic approach: learn to generate from training distribution via 2-player games.