We are proud to announce the first Gephi-based system for exploring a complex network, CPAN-Explorer. This is a visualization project aiming at analyzing the relationships between the developers and the packages of the Perl language, known to be organized as the CPAN community (Comprehensive Perl Archive Network). Produced by RTGI Labs and our team, it was initially discussed in a talk at the FPW’09.
You can download original graph source files from each subproject page.
Available formats are: GEXF (Gephi graph format), GDF (Guess graph format), SVG, and PDF.
We generated two maps (authors and modules) using the CPANTS data. For the websites, we crawled a seed generated from the CPAN pages of the previous authors. Each of this graphs are generated using a force base algorithm.
All the map are available in PDF files, in creative common licence. The slides are in french, but we will explain the three maps here.
The first map is about the modules available on the CPAN. We selected a list of modules which are listed as dependancies by at least 10 others modules, and the modules who used them. This graph is composed of 7193 nodes (or modules) and 17510 edges. Some clusters are interesting:
- LWP and URI are really the center of the CPAN
- a lot of web modules (XML::*, TemplateToolkit, HTML::Parser, …)
- TK is isolated from the CPAN
- Moose, DBIx::Class and Catalyst are forming a group. This data are from march, we will try to do a newer version of this map this summer. This one will be really interesting as Catalyst have switched to Moose
The CPAN’s authors
This map is about the authors on the CPAN. There is about 700 authors and their connections. Each time an author use a module of another author, a link is created.
Modern Perl, constitued by Moose, Catalyst, DBIx::Class. Important authors are Steven, Sartak, perigin, jrockway, mstrout, nothingmuch, marcus ramberg
Slaven ReziÄ‡ and others TK developpers are on the border
We crawled the web using the seed generated using the CPAN’s authors pages.
- again, the “modern group”, on the top of the map, with Moose/Catalyst/DBIx::Class developpers
- some enterprises, like shadowcat and iinteractive in the middle of the “modern Perl”, Booking in the middle of the YAPC’s websites (they are a major sponsor of this events), 6apart, …
- perl.org is the reference for the Perl community (the site is oriented on their sides)
- cpan.org is the reference for the open source community
- github is in the center of the Perl community. It’s widely adopted by the Perl developpers. It offers all the “social media” features that are missing on the CPAN
We hope you like this visualisations, have fun analyzing them 🙂
Thanks Franck for the original post.