Collection of all my blog articles

Investigating the iris dataset

Because the iris dataset does not contain any missing values or other irregularities. We can immediately jump to further preprocessing steps. For the full code see ....

Data preprocessing

We will first split our training set into test and train.

data(iris)
set....

In this post we will look at the paper from M. S. Waterman and R. M. Idury titled "A new algorithm for DNA sequence assembly" which is one of the foundational papers for the usage of deBruijn graphs in genome assembly. We will also try to verify the proposed formulas with own experimental data gener...