Tiny Python Projects - Book Summary
The following post contains a summary of the book titled Tiny Python Projects by Ken Youens-Clark
The following post contains a summary of the book titled Tiny Python Projects by Ken Youens-Clark
What did I learn from going through 2 hours of videos on Pytest ?
pytest the most popular Python package for testingunittesting comes with Python. It is used to test the internals of core python. It is a good solid tool but there are a lot of api calls that one might have to learn.’s are used to represent the number of functions tested in a test fileException and pytest can be used to check whether the right exceptions are occurring in the code.method is a function attached to a classfixtures that helps you incorporate the set up that goes along with testing a functionfixtures should be a part of conftest.pypytest has a built-in fixturespytest has a rich set of ecosystem that gives a set of variety of new fixturesfixtures for setup/reuseAfter working through the examples, I am now much more comfortable in going through the book on pytest. There is no doubt that I will be using all these things in the UOB project implementation
The following post contains a summary of the book titled Python Workout by Reuven M. Lerner
The following are the learnings from Hugging Face Interview in Oct 2019
| |
Here are the top 25 models as of
Watched a fantastic Ted Talk that highlighted the importance of gamifying learning
no penalty means increased attempts and better scoreThe following are my learning from the paper titled, Attention is all you need :

What did I learn from attention primer from Jay Allamar
What did I learn from CodeEmporium about BERT and Transfomers
What did I learn from transformer primer from Jay Allamar
This blog post summarizes the book titled “Information Theory”
The following are my learnings from the book .
The following are the takeaways from the course:
The following are the learnings from the podcast:
The following are the learnings from the podcast:
The following are the learnings from the podcast:
The following are the takeaways from Ian GoodFellow Interview.

Kyle Polich discusses the limitations of Bert Model.
This post summarizes the book, “Indistractable”.
My learnings from this article on parallel processing :
My learnings from attending a 3 hour session on Deep Learning :
The following is an excellent summary of Data Leakage in time series testing.
The following are my learnings from the Meetup talk by Prof Hyndman.
The following are my learnings from the Hyndman-MASE.

Kyle Polich discusses BERT. The following are my takeaways.
The following are my takeaways from the talk, Embedding work in NLP.
The following are some of the takeaways from this medium article.
In this brief post, I would like to pen down my thoughts on two aspects: Heuristics and Non-Intepretability of models.
Let’s look at word embedding matrix. If you take a bunch of words and want to build a learning algorithm, the first task is to convert the text in to a bunch of numbers. The two popular algorithms that have revolutionized the field of NLP are Skipgram method and CBOW method. Both involve learning a lower dimensional representation of the word. The dimensions are not interpretable as the dimensions are not unique. The fact that dimensions are not interpretable did not stop someone from developing fantastic applications. Suppose you are in foreign country and you are lost and want to check with someone the correct way to your destination: You flip open your phone, speak your native language and your phone translates the sentence to a foreign language (text/audio), and use it converse with strangers. The job gets done. Do you really care how the word embedding algo is working ? Not really. So, we don’t need to be hung up in intepretability for all applications. In trading for example, if the strategy makes money, you might not care too much about the interpretability of the strategy.
This post illustrates CNN LSTM Models that can be fit to simple time series data.
This post illustrates the early stopping callback in Keras.
This post illustrates generative LSTM via a plain vanilla LSTM.
This post creates a Bidirectional LSTM and learns a simple pattern in the sequence.
This post has two pop quizzes relating to the output of LSTM.
This post creates a plain vanilla LSTM and learns a simple pattern in the sequence.
This post creates a sequence to sequence LSTM and learns a simple pattern in the sequence.
This post creates a Stacked LSTM and learns a simple pattern in the sequence.
This post attempts to summarize the contents of the book Ultra learning by Scott H Young.
The following are my learnings from Jose course on NLP.
This post gives some of the learnings from the deliberate practice on spacy.
I was able to undersand the basic idea behind RNN after working through an example from the book by Antonio Gulli.
I was puzzled with the way LSTMs were used to do sentiment analysis. Finally the book by Antonio Gulli helped me understand the mechanics of the LSTM.
In this article, I will explain the way you can code a simple RNN that tracks a simple shift in the pattern, i.e a value from a normal distribution.
As compared to previous implementations where we had used OutputProjectionWrapper, this code does away with that component and does it more efficiently
| |
| |
| |
The output from testing validation data is
In this article, I will explain the way you can code a simple RNN that tracks a simple shift in the pattern
| |
| |
| |
The output from testing validation data is
In this article, I will explain the way you can code a simple RNN that tracks cumulative sums of a sequence
| |
| |
| |
The output from testing validation data is
I have been struggling to implement parity of sequence since many weeks. Finally after going through Geron’s book, I am now able to successfully implement an algo that learns the parity of a sequence. This does not use the classification tweak that many apply to solve the parity problem
| |
| |
| |
The output from testing validation data is
The word “Bond” can depend on the context that you come across. If it is a community news letter, the bond could be used in the context of “Walk to Bond”, “Run to Bond”, “Meet to Bond”. Here the word “Bond”, means bonding.
In the context of Financial news, “Bond” could in all probability mean a financial instrument.
If you have a word2vec model, “Bond” is collapsed in to one dimension and hence loses all the context.

The following are the learnings from the podcast:
Need to work on basics of NN and then move on Transfer learning

The following are the learnings from the podcast:

The following are the learnings from the podcast
The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

Kyle Polich discusses NER in this podcast. My learnings are

Kyle interviews Zane and Leena about the Endangered Languages Project. My learnings are
In this project, we are trying to speed up the process of language documentation by building a model that produces phonetic transcriptions from audio samples of human languages. The ultimate goal of our project is to develop a model that could be applied to any human language with minimal changes. We will be using around 3-4 hours of partially labeled audio data in an endangered language called Ladin, which we are using as our main training/test data. As of now we have produced some decent results in vowel identifications and are currently working on phoneme segmentation and identification of larger consonant categories.