14- Data Mining / DBSCAN and Spectral Clustering

Institution: Pontifical Catholic University of São Paulo (PUC-SP)
School: Faculty of Interdisciplinary Studies
Program: Humanistic AI and Data Science Semester: 2nd Semester 2025
Professor: Professor Doctor in Mathematics Daniel Rodrigues da Silva

🎶 Prelude Suite no.1 (J. S. Bach) - Sound Design Remix

Statistical.Measures.and.Banking.Sector.Analysis.at.Bovespa.mp4

📺 For better resolution, watch the video on YouTube.

Tip

This repository is a review of the Statistics course from the undergraduate program Humanities, AI and Data Science at PUC-SP.

☞ Access Data Mining Main Repository

Important

⚠️ Heads Up

Projects and deliverables may be made publicly available whenever possible.
The course emphasizes practical, hands-on experience with real datasets to simulate professional consulting scenarios in the fields of Data Analysis and Data Mining for partner organizations and institutions affiliated with the university.
All activities comply with the academic and ethical guidelines of PUC-SP.
Any content not authorized for public disclosure will remain confidential and securely stored in private repositories.

Welcome to your repository guide for DataMining DBSCAN_and_Spectral Clustering. This Repo is written so anyone even kids, can understand the two powerful clustering algorithms: DBSCAN and Spectral Clustering.

What is Clustering ?

Clustering is a way for computers to group things that are similar—like organizing marbles by color, or animals by species. The computer looks for natural groups in the data, so points in the same group are more like each other than points in other groups. Some points might not fit anywhere; finding them is important too!

DBSCAN Algorithm

DBSCAN stands for "Density-Based Spatial Clustering of Applications with Noise." It helps find groups in data where points are close together, based on how many neighbors each point has.

How DBSCAN Works (Step-by-Step)

Pick any point not yet checked.
Draw a circle around it: The size the circle (called epsilon, $ \varepsilon $) says what counts as "close."
Count all the neighbors inside the circle.
- If enough neighbors (at least MinPts), this is a core point—start a new group!
- If not enough: Maybe a border point or "noise."
Grow the group: For each direct neighbor that is a core point, include their neighbors too—so the group grows!
Repeat: Until every point is grouped or marked as noise.

Core, Border, and Noise Points

Core point: Has lots of friends (enough neighbors within $ \varepsilon $).
Border point: Doesn't have enough direct neighbors, but is close to a core point.
Noise: Too far from any busy area. Not in a group at all!

Bibliography

1 Abdi, H. & WilliamsC, L.J. Principal Component Analysis. Wiley Interdisciplinary Reviews, 2010.

2. Castro, L. N. & Ferrari, D. G. (2016). Introdução à mineração de dados: conceitos básicos, algoritmos e aplicações. Saraiva.

3. Dunteman, J. Principal Component Analysis. SAGE Publications, 1989.

4. Ferreira, A. C. P. L. et al. (2024). Inteligência Artificial - Uma Abordagem de Aprendizado de Máquina. 2nd Ed. LTC.

5. Larson & Farber (2015). Estatística Aplicada. Pearson.

6. Liu, F.T. et al. Isolation Forest. IEEE ICDM, 2008.

💌 Let the data flow... Ping Me !

🛸๋ My Contacts Hub

────────────── 🔭⋆ ──────────────

➣➢➤ Back to Top

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
DBSCAN and Spectral Clustering with Monkey Dataset		DBSCAN and Spectral Clustering with Monkey Dataset
DBSCAN with Spiral Dataset		DBSCAN with Spiral Dataset
Workbook		Workbook
dataset _monkey		dataset _monkey
dataset_spiral		dataset_spiral
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

14- Data Mining / DBSCAN and Spectral Clustering

🎶 Prelude Suite no.1 (J. S. Bach) - Sound Design Remix

📺 For better resolution, watch the video on YouTube.

☞ Access Data Mining Main Repository

Welcome to your repository guide for DataMining DBSCAN_and_Spectral Clustering. This Repo is written so anyone even kids, can understand the two powerful clustering algorithms: DBSCAN and Spectral Clustering.

Table of Contents

What is Clustering ?

DBSCAN Algorithm

How DBSCAN Works (Step-by-Step)

Core, Border, and Noise Points

Bibliography

💌 Let the data flow... Ping Me !

🛸๋ My Contacts Hub

Copyright 2025 Quantum Software Development. Code released under the MIT License license.

About

Uh oh!

Sponsor this project

Uh oh!

Uh oh!

Languages

Uh oh!

License

Quantum-Software-Development/14-DataMining_DBSCAN_and_Spectral-Clustering

Folders and files

Latest commit

History

Repository files navigation

14- Data Mining / DBSCAN and Spectral Clustering

🎶 Prelude Suite no.1 (J. S. Bach) - Sound Design Remix

📺 For better resolution, watch the video on YouTube.

☞ Access Data Mining Main Repository

Welcome to your repository guide for DataMining DBSCAN_and_Spectral Clustering. This Repo is written so anyone even kids, can understand the two powerful clustering algorithms: DBSCAN and Spectral Clustering.

Table of Contents

What is Clustering ?

DBSCAN Algorithm

How DBSCAN Works (Step-by-Step)

Core, Border, and Noise Points

Bibliography

💌 Let the data flow... Ping Me !

🛸๋ My Contacts Hub

Copyright 2025 Quantum Software Development. Code released under the MIT License license.

About

Topics

Resources

License

Code of conduct

Security policy

Uh oh!

Stars

Watchers

Forks

Sponsor this project

Uh oh!

Uh oh!

Languages