Home / News / Sourcegraph plans to index the entire open source web

Sourcegraph plans to index the entire open source web

The Turn out to be Generation Summits get started October 13th with Low-Code/No Code: Enabling Undertaking Agility. Sign up now!

Let the OSS Undertaking publication information your open supply adventure! Enroll right here.

Sourcegraph is increasing its common code seek platform to the cloud and within the procedure indexing tens of millions of public repositories from GitHub and GitLab so any person can seek them. The release comes sizzling at the heels of a $125 million collection D investment spherical that valued the corporate at a hefty $2.6 billion.

“We’re launching Sourcegraph.com as a full-fledged product for looking out the open supply universe,” Sourcegraph cofounder and CTO Beyang Liu informed VentureBeat.

Large code drawback

Based in 2013, Sourcegraph got down to “take on the massive code drawback” with a platform that addresses the rising quantity and number of supply code maximum companies must maintain throughout their tasks. With each corporate now necessarily a instrument corporate, all of them must maintain code (to various levels). However as those codebases develop and extra repositories and developer equipment are thrown into the enormous coding cauldron, it turns into trickier to control the whole lot and more difficult for builders to satisfy dash cut-off dates.

To deal with this problem, Sourcegraph combines the more than a few strands that make up a contemporary developer operations (DevOps) stack, spanning repositories, programming languages, record codecs, editors, and extra. Thru Sourcegraph, builders can to find and sort things extra temporarily, determine how you can use a selected serve as, determine what affect converting a work of code can have on dependencies, automate large-scale refactors, and extra.

Sourcegraph plans to index the entire open source web - Sourcegraph plans to index the entire open source web

Above: Sourcegraph: Massive-scale refactor with automatic “batch alternate”

To this point, Sourcegraph consumers equivalent to Amazon, Cloudflare, Uber, and PayPal have needed to run self-hosted Sourcegraph circumstances. However as a part of its project to index all of the open supply internet and make it searchable, the San Francisco-based corporate may be ushering the trade facet of its operations into the hosted cloud technology.

Whilst this will likely indubitably enchantment to startups and person coders, for the reason that the cloud makes it more uncomplicated to collaborate and seek for repositories, it is going to additionally open Sourcegraph’s target audience to a broader vary of endeavor consumers preferring a cloud product.

The corporate hasn’t given a selected date for this shift, but it surely mentioned as of late’s announcement units the wheels in movement for a “larger release” q4 that can carry Sourcegraph “to a brand new batch of businesses.”


Sourcegraph’s new portal is a seek engine for code that permits any person to search out and pore over tens of millions of open supply tasks and private personal code without cost — the facility so as to add personal repositories to Sourcegraph’s cloud wasn’t to be had to the general public prior to now. Sourcegraph will even price corporations to add their personal repositories so interior builders can seek them from their browser.

“This can be a important transfer for us as an organization as it indicators our shift to a SaaS trade fashion,” Liu mentioned.

In the past, Sourcegraph.com used to be “principally an excellent giant demo of Sourcegraph Undertaking,” in keeping with Liu, that means there used to be no means for customers so as to add their very own public or personal repositories. “The hunt index used to be giant by means of interior codebase requirements however small in comparison to the entire quantity of fascinating open supply [projects],” he mentioned.

Although the general public code seek interface has been are living for a while already as an explanation of idea, for as of late’s professional release Sourcegraph has listed the highest 1 million repositories on GitHub and kind of 12,000 from GitLab. By way of the tip of the 12 months, it plans to push the entire determine to greater than five million — each GitHub and GitLab repository with a couple of megastar.

“We’re prioritizing by means of high quality as a result of while you’re looking out over code, you care about discovering the most efficient serve as or easiest utilization instance, now not just a few random code snippet that would possibly include insects,” Liu defined.

Sourcegraph will even come with outstanding open supply tasks that aren’t on both GitHub or GitLab, and builders will be capable of manually upload any repository themselves, irrespective of its megastar score.

“Google for code”

Whilst code is already searchable via its respective code hosts, Liu likens the established order to that of internet seek within the days of AltaVista.

“What we’re construction is extra like a Google for code,” Liu defined. “Sourcegraph is clearly slightly other from Google Seek, as a result of code is an excessively other type of knowledge. Nevertheless it’s identical in that we’re fixing the hunt drawback as a firstclass citizen — we’ve invested in deep generation that allows us to construct a a lot better consumer enjoy. And as a end result, builders who use Sourcegraph to find themselves looking out over code an order of magnitude greater than once they had been simply the usage of their code host’s seek capability.”

Pooling GitHub and GitLab will most likely duvet the lion’s proportion of “profitable” open supply tasks and cause them to searchable via a unmarried interface, saving builders from having to discuss with other channels and interfaces to search out what they’re on the lookout for.

“We see this always with our consumers that experience more than one code hosts — one of the most giant attracts of Sourcegraph is it’s intuitive and the whole lot is obtainable in a single position,” Liu defined. “Now we will be able to have the entire open supply discoverable in a single position too.”


VentureBeat’s project is to be a virtual the town sq. for technical decision-makers to achieve wisdom about transformative generation and transact.

Our web site delivers crucial data on knowledge applied sciences and techniques to lead you as you lead your organizations. We invite you to transform a member of our group, to get right of entry to:

  • up-to-date data at the topics of hobby to you
  • our newsletters
  • gated thought-leader content material and discounted get right of entry to to our prized occasions, equivalent to Turn out to be 2021: Be informed Extra
  • networking options, and extra

Turn into a member


Check Also

Relyance emerges from stealth to spot risky code 310x165 - Relyance emerges from stealth to spot risky code

Relyance emerges from stealth to spot risky code

The Turn into Era Summits get started October 13th with Low-Code/No Code: Enabling Undertaking Agility. …