OpenDigger
OpenDigger is an open source analysis report project for all open source data initiated by X-lab, this project aims to combine the wisdom of global developers to jointly analyze and insight into open source related data to help everyone better understand and participate in open source.
Metrics or Indices Usage
All implemented metrics are open for anyone to use, you can find the data with following URLs, The root URL of OpenDigger static data is https://oss.x-lab.info/open-digger/github/
right now since we only have GitHub data for now, just replace the org/repo
or owner
to get your data.
Feel free to use the data to construct your own data application and you can refer OpenDigger as your data source and welcome to use the following badge in your project to show the data source.
For repos
For users
Type | Name | From | Example | Code | CodePen |
---|---|---|---|---|---|
Index | OpenRank | X-lab | openrank.json | Link | Demo |
Activity | X-lab | activity.json | Link | Demo | |
Network | Developer network | X-lab | developer_network.json | Link | Demo |
Repo network | X-lab | repo_network.json | Link | Demo |
Users
For above data, there are some users of OpenDigger right now.
Applications
- OpenLeaderboard: A leaderboard of GitHub world with labeled data.
- Hypercrx: A browser plugin helps to look into GitHub users and repos.
- Mulan Dashboard: A dashboard for Mulan community.
- Hacking Force China: A ranking list of Chinese developers on GitHub cooperate with SegmentFault.
Open source reports
- China Open Source Report 2021: Most comprehensive open source report in China by kaiyuanshe.
- China Open Source Blue Paper 2022: A blue paper of Chinese open source development by COPU.
- Big Data Open Source Heat Report: A heat report of open source projects in big data area.
Events
OpenDigger community also open to inter-community cooperation events, like contests or hackathons.
- PaddlePaddle Hackathon 3rd, the hackathon final reports are here.
Data
GitHub Event Log
We use GHArchive as our data source for GitHub event logs and the data service is provided by clickhouse cluster cloud service. For data details, please check the data docs.
Labeled Data
We are collecting labeled data for more deeper analysis. You can view the corresponding data in the labeled_data
folder. For more details, please check labeled_data docs.
Sample Data Usage
OpenDigger provides ClickHouse sample data and Jupyter notebook image to run OpenDigger in local environment, please refer to sample data doc.
Communication
Welcome to join the WeChat group by scanning the QRCode and I will invite you into our WeChat group.
License
We use Apache-2.0 license for code part, please make sure abide by the licenses when using the project.