Cloudy1225

Cloudy1225 / PyAGC

Public

Attributed Graph Clustering Library for PyTorch

23
5
100% credibility
Found Feb 10, 2026 at 19 stars -- GitGems finds repos before they trend. Get early access to the next one.
Sign Up Free
AI Analysis
Python
AI Summary

PyAGC is a user-friendly Python library that implements and benchmarks advanced methods for grouping similar nodes in networks with features, scaling to massive sizes.

How It Works

1
🔍 Discover PyAGC

You hear about PyAGC, a helpful toolkit for grouping similar items in networks like friends or products.

2
📥 Get it ready

Download and set up the toolkit on your computer in moments.

3
🕸️ Load your network

Bring in your network data, whether small friends list or huge product catalog.

4
🎯 Choose grouping style

Pick from smart ways to group items based on connections and traits.

5
Watch groups form

Hit start and see your network magically split into clear, meaningful groups.

6
📊 Review insights

Check quality scores, timings, and comparisons to understand your groups.

🎉 Groups ready!

Celebrate having tidy communities for better decisions and discoveries.

Sign up to see the full architecture

5 more

Sign Up Free

Star Growth

See how this repo grew from 19 to 23 stars Sign Up Free
Repurpose This Repo

Repurpose is a Pro feature

Generate ready-to-use prompts for X threads, LinkedIn posts, blog posts, YouTube scripts, and more -- with full repo context baked in.

Unlock Repurpose
AI-Generated Review

What is PyAGC?

PyAGC is a Python library built on PyTorch and PyTorch Geometric for attributed graph clustering, unifying over 20 state-of-the-art methods like deep attentional embedding approaches and adaptive graph convolution under a simple encode-cluster-optimize framework. It handles everything from loading attributed graph datasets to training scalable models that run on graphs up to 111 million nodes using a single 32GB GPU, with mini-batch support and YAML-driven configs for quick experiments. Developers get reproducible benchmarks across 12 diverse datasets, complete with supervised accuracy metrics and unsupervised ones like modularity and conductance.

Why is it gaining traction?

It stands out by fixing common pain points in attributed graph clustering: over-reliance on tiny homophilous graphs, poor scalability beyond 100k nodes, and inconsistent evaluations that ignore structural quality. The plug-and-play design lets you swap encoders or cluster heads via config tweaks, while production-tested mini-batching and GPU-accelerated KMeans make it deployable for real-world tasks like fraud detection. Clean docs, PyPI install, and full benchmark reproducibility hook devs tired of scattered, non-scalable codebases.

Who should use this?

Graph ML engineers at companies like Ant Group tackling user profiling, anti-money laundering, or community detection on large attributed graphs with tabular or textual features. Researchers benchmarking methods on heterophilous or massive datasets, needing both label alignment (ACC, NMI) and graph structure metrics. PyTorch Geometric users wanting a drop-in library for attributed graph clustering without reinventing encoders or loaders.

Verdict

Grab it if you're in graph clustering—strong scalability, docs, and benchmarks outweigh the early maturity (19 stars, 1.0% credibility score). Low adoption means watch for edge cases, but it's a smart bet for PyTorch workflows.

(198 words)

Sign up to read the full AI review Sign Up Free

Similar repos coming soon.