108 articles on data science practice — Pandas, NumPy, exploratory analysis, data cleaning, and preprocessing.
Part of the xbe.at knowledge base. ← Back to index
- DataFrames: filtering, merging, concatenation, groupby, pivot tables
- Advanced techniques: many-to-one relationships,
pd.concat(),.describe() - Performance: avoiding
apply(), vectorization withnp.where(), Pandas + Dask - Missing data handling, duplicate removal, data type management
- Array creation, manipulation, broadcasting
- Column-wise operations, finding minimums in 2D arrays
- Choosing between NumPy and Pandas
- Handling missing values (multiple strategies)
- Outlier detection and filtering
- Encoding: one-hot, label, target encoding
- Feature scaling, normalization, standardization
- Variable types (11 categories), data summarization with Skimpy
- Automatic error detection in tabular datasets
- Confidence and prediction intervals
- Correlation methods, predictive power score
- Financial data analysis, user engagement and churn analysis
- Time series analysis with Python and Statsmodels
- Interview questions for data science roles
- Best practices: avoiding bad coding habits, data leakage
- Comprehensive Guide to Python Pandas
- 10 Underrated Python Packages for Data Science
- Advanced Handling Missing Data in Python
- Comprehensive Guide to Target Encoding in Python
- Avoiding Bad Coding Practices in Data Science