An open-source library for data mining and data analysis. This package implements various methods of machine learning such as supervised learning (data classification, data regression, etc.), unsupervised learning (data clustering, etc.), and data pre-processing. This package is implemented on Python numerical libraries, NumPy and Scipy, and supports parallel computation.