Browsing All Posts published on »April, 2011«

Customizing scikits.learn for a specific text analysis task

April 29, 2011

2

Scikits.learn is a great general library, but machine learning has so many different application, that it is often very helpful to be able to extend its API to better integrate with your code. With scikits.learn, this is extremely easy to do using inheritance and using the pipeline module. The problem While continuing the morphophonetic analysis […]

An overview of dictionary learning: Terminology

April 15, 2011

2

My GSoC proposal is titled “Dictionary learning in scikits.learn” and in the project, I plan to implement methods used in state of the art research and industry applications in signal and image processing. In this post, I want to clarify the terminology used. Usually the terms dictionary learning and sparse coding are used interchangably. Also […]

Newton interpolation and numerical differentiation

April 15, 2011

1

I am sharing some Python code code that I wrote as a school assignment. This computes the Newton form of the interpolation polynomial of a given set of points, and allows for the evaluation of both the polynomial and its derivative, at a given point. This is an accurate way of estimating the derivative of a […]

A look at Romanian verbs with scikits-learn

April 14, 2011

4

One of the problems we tackled here at my university is one as old as the modern Romanian language. It is a problem for linguists, as well as for foreigners trying to learn the language. We call it the root alternations problem. Similar to French and other languages, Romanian verbs are split into four groups […]

Tweaking matplotlib subplots for pretty results

April 4, 2011

1

When plotting multiple subplots using matplotlib, the axes rarely look pretty with the default configuration. Since matplotlib figures are abstract objects, designed for consistency in print as well as on screen, tweaking their layout can get tricky. An example The following code is taken from the face recognition example in scikits.learn: pl.figure(figsize=(1.8 * n_col, 2.4 […]

On setuptools subpackages

April 4, 2011

1

Today, I spent more than two hours trying to figure out why, despite things working out fine in my development scikits.learn folder, python setup.py install would completely ignore the module I refactored into a subpackage. I imagined that simply adding it to the parent __init__.py __all__ attribute would do, I kind of thought that setuptools […]

My first scikits.learn coding sprint

April 2, 2011

0

The fifth scikits.learn coding sprint took place Friday, April 1st 2011. For anyone who is not familiar with it, scikits.learn is a fast and easy to use machine learning toolkit for the pylab environment (Python, NumPy, SciPy, Matplotlib.) This was a good opportunity for me to get code reviews by the developers in order to bring […]