Categories
Blog

Basics of NumPy: Part 1

The universe of machine learning and data science is a fascinating one. On the surface, it may seem as though one is inundated with data in various forms – be it text, image, or voice, however, if dealt with properly, it makes for not just a great learning experience, but a rather enjoyable one too! […]

Categories
Blog

Multiclass Text Classification using LSTM in Pytorch

Predicting item ratings based on customer reviews Human language is filled with ambiguity, many-a-times the same phrase can have multiple interpretations based on the context and can even appear confusing to humans. Such challenges make natural language processing an interesting but hard problem to solve. However, we’ve seen a lot of advancement in NLP in the […]

Categories
Blog

Image Processing Techniques for Computer Vision

Image Processing is an integral part of Computer vision. We almost always want to resize images, do data augmentation, see images in a grid, etc. OpenCV (Open source computer vision), scikit-image, Pillow are some popular image processing libraries in Python. In this article, I’ve covered some of the most commonly used Image processing techniques. Here’s […]

Categories
Blog

Exploratory Data Analysis (EDA) —  Understanding the Gender Divide in Data Science Roles

with Shreejaya Bharathan on 2018 Kaggle ML & DS Survey data Women have been historically underrepresented in STEM fields and face discrimination in the workplace. According to a study conducted in 2018, “63 percent of the time, women receive lower salary offers than men for the same job at the same company.’’ Does the Data […]

Categories
Blog

Building a movie genre classifier using a dataset created using Google Images

using fast-ai ‘Google Images’ is a great source to find relevant images while constructing a database for a classification problem. Let’s take the problem of classifying movie posters based on their genre. We’re going to take three classes that have the least overlap: romance, horror, and superhero. Creating the Dataset Getting a list of URLs: The first […]