Skip to content

Latest commit

 

History

History

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

README.md

Data Science

108 articles on data science practice — Pandas, NumPy, exploratory analysis, data cleaning, and preprocessing.

Part of the xbe.at knowledge base. ← Back to index


What's inside

Pandas

  • DataFrames: filtering, merging, concatenation, groupby, pivot tables
  • Advanced techniques: many-to-one relationships, pd.concat(), .describe()
  • Performance: avoiding apply(), vectorization with np.where(), Pandas + Dask
  • Missing data handling, duplicate removal, data type management

NumPy

  • Array creation, manipulation, broadcasting
  • Column-wise operations, finding minimums in 2D arrays
  • Choosing between NumPy and Pandas

Data cleaning & preprocessing

  • Handling missing values (multiple strategies)
  • Outlier detection and filtering
  • Encoding: one-hot, label, target encoding
  • Feature scaling, normalization, standardization

Exploratory Data Analysis (EDA)

  • Variable types (11 categories), data summarization with Skimpy
  • Automatic error detection in tabular datasets
  • Confidence and prediction intervals
  • Correlation methods, predictive power score

Applied data science

  • Financial data analysis, user engagement and churn analysis
  • Time series analysis with Python and Statsmodels
  • Interview questions for data science roles
  • Best practices: avoiding bad coding habits, data leakage

Highlights

  • Comprehensive Guide to Python Pandas
  • 10 Underrated Python Packages for Data Science
  • Advanced Handling Missing Data in Python
  • Comprehensive Guide to Target Encoding in Python
  • Avoiding Bad Coding Practices in Data Science