Leer Hoog-dimensionale Gegevens en de Vloek van Dimensionaliteit | Introductie tot Dimensionaliteitsreductie

Hoog-dimensionale data bevat veel kenmerken of kolommen. Naarmate je meer dimensies toevoegt, verspreiden de datapunten zich verder uit elkaar en wordt de ruimte steeds leger. Dit maakt het moeilijk om patronen te vinden, omdat de afstanden tussen punten hun betekenis verliezen. Dit wordt de vloek van dimensionaliteit genoemd—de uitdaging om data te analyseren wanneer er te veel kenmerken zijn.


              1234567891011121314151617181920212223242526272829
            
import numpy as np
import matplotlib.pyplot as plt

# Generate random points in 2D
np.random.seed(0)
points_2d = np.random.rand(100, 2)

# Generate random points in 3D
points_3d = np.random.rand(100, 3)

fig = plt.figure(figsize=(12, 5))

# Plot 2D points
ax1 = fig.add_subplot(1, 2, 1)
ax1.scatter(points_2d[:, 0], points_2d[:, 1], color='blue', alpha=0.6)
ax1.set_title('100 Random Points in 2D')
ax1.set_xlabel('X')
ax1.set_ylabel('Y')

# Plot 3D points
ax2 = fig.add_subplot(1, 2, 2, projection='3d')
ax2.scatter(points_3d[:, 0], points_3d[:, 1], points_3d[:, 2], color='red', alpha=0.6)
ax2.set_title('100 Random Points in 3D')
ax2.set_xlabel('X')
ax2.set_ylabel('Y')
ax2.set_zlabel('Z')

plt.tight_layout()
plt.show()

Was alles duidelijk?

Bedankt voor je feedback!

Sectie 1. Hoofdstuk 2

Vraag AI

Vraag wat u wilt of probeer een van de voorgestelde vragen om onze chat te starten.

Veeg om het menu te tonen


              1234567891011121314151617181920212223242526272829
            
import numpy as np
import matplotlib.pyplot as plt

# Generate random points in 2D
np.random.seed(0)
points_2d = np.random.rand(100, 2)

# Generate random points in 3D
points_3d = np.random.rand(100, 3)

fig = plt.figure(figsize=(12, 5))

# Plot 2D points
ax1 = fig.add_subplot(1, 2, 1)
ax1.scatter(points_2d[:, 0], points_2d[:, 1], color='blue', alpha=0.6)
ax1.set_title('100 Random Points in 2D')
ax1.set_xlabel('X')
ax1.set_ylabel('Y')

# Plot 3D points
ax2 = fig.add_subplot(1, 2, 2, projection='3d')
ax2.scatter(points_3d[:, 0], points_3d[:, 1], points_3d[:, 2], color='red', alpha=0.6)
ax2.set_title('100 Random Points in 3D')
ax2.set_xlabel('X')
ax2.set_ylabel('Y')
ax2.set_zlabel('Z')

plt.tight_layout()
plt.show()

Was alles duidelijk?

Bedankt voor je feedback!

Sectie 1. Hoofdstuk 2