Our Data
The Dataset
User Data
Comprehensive, up to date data on every active github user.
Each profile contains core information on the user, including name, email, and linkedin + their github repos, events, issues, and commits.
Fields
Core Fields
Core information on the person
blog
: the user’s blog linkgithub_username
: the user’s GitHub usernameinstagram_username
: the user’s Instagram usernamex_username
: the user’s X (Twitter) usernamelinkedin_username
: the user’s LinkedIn usernameavatar_url
: the user’s GitHub avatar image URLpublic_gists
: number of public GitHub gistsbio
: the user’s GitHub bioemail
: primary/best email for the userlast_name
: last namefirst_name
: first namemiddle_name
: middle namefull_name
: full namefollowers
: number of GitHub followersfollowing
: number of GitHub users followedpublic_repos
: number of public repositorieshireable
: whether the user is marked as hireable on GitHublocation
: location listed on GitHubcountry
: countrycompany
: company listed on GitHubcompany_name
: the company name, normalizedcompany_linkedin
: their company’s linkedin urlcompany_size
: the size of their company as a range ex. 11-50company_website
: their company’s websitecompany_industry
: their company’s industrycompany_founded
: year their company was foundedcompany_country
: country their company is based intitle
: job titlework_email
: work emailcreated_at
: GitHub account creation date (Flatfile Only)updated_at
: date of last GitHub profile update (Flatfile Only)all_names
: all names associated with the usersocial_accounts
: list of social media accounts (URLs)emails
: list of known email addresses
Repos
Github repos the user has forked or authored
repos.id
: unique ID of the repositoryrepos.name
: name of the repositoryrepos.full_name
: owner/repo formatrepos.description
: short description of the repositoryrepos.language
: primary programming languagerepos.created_at
: timestamp when the repo was createdrepos.updated_at
: timestamp when the repo was last updatedrepos.stargazers_count
: number of GitHub starsrepos.forks_count
: number of forksrepos.open_issues_count
: number of open issuesrepos.archived
: whether the repo is archivedrepos.topics
: a lirepos.default_branch
: the name of the repo’s default branchrepos.owner.login
: the repo owner’s github usernamerepos.owner.type
: the repo owner’s github account type (user or organization)repos.owner.avatar_url
: the repo owner’s avatar url
Commits
Github commits the user has made
commits.sha
: commit SHA hashcommits.url
: HTML URL to the commit on GitHubcommits.message
: commit messagecommits.author_name
: name of the commit authorcommits.author_email
: email of the commit authorcommits.date
: date/time of the commitcommits.repo_full_name
: full repository name (owner/repo)commits.repo_url
: HTML URL to the repository
Issues
Github issues the user created, commented on, was tagged in, or interacted with
issues.raised_by
: GitHub username of the person who created or raised the issueissues.date
: Date the issue was createdissues.updated_at
: Date the issue was last updatedissues.closed_at
: Date the issue was closed (if applicable)issues.link
: Direct URL to the issue on GitHubissues.title
: Title of the issueissues.message
: Combined title and body content of the issueissues.type
: Type of event (always “issues” in this context)issues.description
: Description/body of the issueissues.issue_number
: Numeric ID of the issue within the repositoryissues.status
: Current status of the issueissues.author_association
: Author’s relationship to the repositoryissues.repo_name
: Full name of the repository (owner/repo format)issues.labels
: List of labels applied to the issueissues.comment_count
: Total number of comments on the issueissues.state_reason
: Reason the issue was closed (e.g. completed, not_planned)issues.merged_at
: Timestamp of merge (if the issue was a pull request), or null
Events
Events, including issues, pull requests, comments, deletions, and stars, the user has made on github
events.date
: timestamp of the event formatted yyyy-mm-ddevents.type
: type of the eventevents.message
: description of the eventevents.repo
: object containing repo metadata, e.g. full_nameevents.link
: URL to view the event (e.g. commit, issue, PR)
Possible values for type
field:
commit
created_branch
created_repo
created_tag
delete
forked_repo
issue_comment
issues
pull_request
push
release
star
watch
Repo Data
Accurate, comprehensive, and real-time data on over a billion github repos.
Fields
Core Fields
full_name
: owner/repo formattopics
: list of topics the repo was tagged withdescription
: short description of the repositorylanguage
: primary programming languagecreated_at
: date when the repo was created yyyy-mm-ddupdated_at
: date when the repo was last updated yyyy-mm-ddstargazers_count
: number of GitHub stars