Skip to content

simulasi penerapan algoritma random forest untuk prediksi rating produk pada plarform e-commerce #32

@rhaitunara

Description

@rhaitunara

import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

Asumsikan DataFrame Anda bernama 'df'

Pastikan kolom 'rating' sudah bersih dari data teks

df['rating'] = pd.to_numeric(df['rating'], errors='coerce').fillna(0)
df['rating'] = df['rating'].astype(int)

Menghitung distribusi rating

rating_counts = df['rating'].value_counts().sort_index()

print("Distribusi nilai di kolom 'rating':")
print(rating_counts)

Membuat visualisasi (grafik batang)

plt.figure(figsize=(10, 6))
sns.countplot(x='rating', data=df, order=df['rating'].value_counts().index, palette='viridis')
plt.title('Distribusi Rating Produk', fontsize=16)
plt.xlabel('Rating Produk')
plt.ylabel('Jumlah Ulasan')
plt.grid(axis='y', linestyle='--', alpha=0.7)
plt.show()

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions