Overview PySpark is a good python library to perform large-scale exploratory data analysis, create machine learning pipelines and create ETLs for a data platform. If you already have an intermediate level in Python and libraries such as Pandas, then PySpark is an excellent language to learn to create more scalable and relevant analyses and pipelines.… Continue reading Pyspark join Multiple dataframes
Dimension in google analytics This is what characterizes a visitor, for example, the visitor’s geographical origin, the version of his browser. It can also be one of the aspects of the traffic entering the site, its source, its origin. Here is an example list of dimensions on google analytics : Medium Browser Country Language Campaign… Continue reading What is a dimension in google analytics ?
Dominoes Rules There are a multitude of ways to play dominoes! In this article I will show you the classic domino rule as well as the main variations of the game. Rule of classical dominoes games 28 dominoes 2 to 4 players 2 players, 7 dominoes per player with 3 or 4 players, 6 dominoes… Continue reading Dominoes Rules – How to play Dominoes ? Dominoes game
75 best chess quotes : You will find in this article, a digest of the best chess quotes! If you like chess, you will find some rare pearls in this rating! Here we go. 🙂 1. I have come to the personal conclusion that while all artists are not chess players, all chess players are… Continue reading 75 Best Chess Quotes of All time !
Chess pieces names – Chess rules : Chess is a game played less with pieces and more with the mind Introduction Chess pieces names and their moves for beginners : The most important thing to understand while learning to play chess games is the pieces that make up the game. Only after you understand how… Continue reading Chess pieces names – Chess Rules
Pandas drop column : If you work in data science and python, you should be familiar with the python pandas library; Pandas development started in 2008 with lead developer Wes McKinney and the library has become a standard for data analysis and management using Python. Mastering the pandas library is essential for professionals working in… Continue reading Pandas drop column
Python pandas read_csv : Pandas read_csv() method is used to read CSV file (Comma-separated value) into DataFrame object. The CSV format is an open text format representing tabular data as comma-separated values. Pandas module is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language.… Continue reading Python pandas read_csv
Pandas get column names : When analyzing large datasets, it may be necessary to obtain column names to perform certain operations on the dataset. In this article, I will show you four ways to retrieve column names in a Pandas dataframe. Let’s start by creating a relatively simple dataset. Now let’s try to get the… Continue reading Pandas get column names
Today we will see how to install jupyter notebook to use python on a windows computer. Jupyter notebooks are electronic notebooks that can gather text, images, mathematical formulas and executable computer code. They can be manipulated interactively in a web browser. Developed for the languages Julia, Python and R (hence the name JUPYTER), it is… Continue reading Jupyter notebook windows
Executing shell commands in python : When working with python, it is sometimes necessary to execute shell/bash codes directly from the python script.In this article, I’m going to show you several methods to do this directly from its python code. The simplest way: Using the OS module The first method to execute a shell command… Continue reading Executing shell commands in python