Sure, you may measure software program developer productiveness
In contrast with different crucial enterprise features reminiscent of gross sales or buyer operations, software program growth is perennially undermeasured. The long-held perception by many in tech is that it’s not potential to do it appropriately—and that, in any case, solely educated engineers are educated sufficient to evaluate the efficiency of their friends. But that establishment is now not sustainable. Now that the majority corporations have gotten (to 1 diploma or one other) software program corporations, no matter {industry}, leaders must know they’re deploying their most dear expertise as efficiently as potential.
There isn’t any denying that measuring developer productiveness is troublesome. Different features will be measured fairly effectively, some even with only a single metric; whereas in software program growth, the hyperlink between inputs and outputs is significantly much less clear. Software program growth can also be extremely collaborative, advanced, and inventive work and requires totally different metrics for various ranges (reminiscent of programs, groups, and people). What’s extra, even when there’s real dedication to trace productiveness correctly, conventional metrics can require programs and software program which might be set as much as enable extra nuanced and complete measurement. For some normal metrics, whole tech stacks and growth pipelines should be reconfigured to allow monitoring, and putting in the required devices and instruments to yield significant insights can require important, long-term funding. Moreover, the panorama of software program growth is altering shortly as generative AI instruments reminiscent of Copilot X and ChatGPT have the potential to allow builders to finish duties as much as two instances sooner.
To assist overcome these challenges and make this crucial process extra possible, we developed an method to measuring software program developer productiveness that’s simpler to deploy with surveys or present information (reminiscent of in backlog administration instruments). In so doing, we constructed on the inspiration of present productiveness metrics that {industry} leaders have developed through the years, with an eye fixed towards revealing alternatives for efficiency enhancements.
This new method has been applied at almost 20 tech, finance, and pharmaceutical corporations, and the preliminary outcomes are promising. They embody the next enhancements:
- 20 to 30 % discount in customer-reported product defects
- 20 % enchancment in worker expertise scores
- 60-percentage-point enchancment in buyer satisfaction rankings
Leveraging productiveness insights
With entry to richer productiveness information and insights, leaders can start to reply urgent questions in regards to the software program engineering expertise they fought so arduous to draw and retain, reminiscent of the next:
- What are the impediments to the engineers working at their greatest degree?
- How a lot does tradition and group have an effect on their skill to provide their greatest work?
- How do we all know if we’re utilizing their time on actions that actually drive worth?
- How can we all know if we’ve all of the software program engineering expertise we’d like?
Understanding the foundations
To make use of a sufficiently nuanced system of measuring developer productiveness, it’s important to know the three kinds of metrics that should be tracked: these on the system degree, the workforce degree, and the person degree. In contrast to a perform reminiscent of gross sales, the place a system-level metric of {dollars} earned or offers closed could possibly be used to measure the work of each groups and people, software program growth is collaborative in a particular manner that requires totally different lenses. As an example, whereas deployment frequency is a wonderfully good metric to evaluate programs or groups, it relies on all workforce members doing their respective duties and is, due to this fact, not a helpful strategy to observe particular person efficiency.
One other crucial dimension to acknowledge is what the varied metrics do and don’t let you know. For instance, measuring deployment frequency or lead time for modifications can provide you a transparent view of sure outcomes, however not of whether or not an engineering group is optimized. And whereas metrics reminiscent of story factors accomplished or interruptions may also help decide optimization, they require extra investigation to determine enhancements that is likely to be useful.
In constructing our set of metrics, we appeared to develop on the 2 units of metrics already developed by the software program {industry}. The primary is DORA metrics, named for Google’s DevOps analysis and evaluation workforce. These are the closest the tech sector has to a typical, and they’re nice at measuring outcomes. When a DORA metric returns a subpar end result, it’s a sign to analyze what has gone unsuitable, which may usually contain protracted sleuthing. For instance, if a metric reminiscent of deployment frequency will increase or decreases, there will be a number of causes. Figuring out what they’re and the best way to resolve them is commonly not easy.
The second set of industry-developed measurements is SPACE metrics (satisfaction and well-being, efficiency, exercise, communication and collaboration, and effectivity and move), which GitHub and Microsoft Analysis developed to enhance DORA metrics. By adopting a person lens, significantly round developer well-being, SPACE metrics are nice at clarifying whether or not an engineering group is optimized. For instance, a rise in interruptions that builders expertise signifies a necessity for optimization.
On high of those already highly effective metrics, our method seeks to determine what will be achieved to enhance how merchandise are delivered and what these enhancements are price, with out the necessity for heavy instrumentation. Complementing DORA and SPACE metrics with opportunity-focused metrics can create an end-to-end view of software program developer productiveness (Exhibit 1).
These opportunity-focused productiveness metrics use just a few totally different lenses to generate a nuanced view of the advanced vary of actions concerned with software program product growth.
Interior/outer loop time spent. To determine particular areas for enchancment, it’s useful to think about the actions concerned in software program growth as being organized in two loops (Exhibit 2). An interior loop includes actions straight associated to creating the product: coding, constructing, and unit testing. An outer loop includes different duties builders should do to push their code to manufacturing: integration, integration testing, releasing, and deployment. From each a productiveness and personal-experience standpoint, maximizing the period of time builders spend within the interior loop is fascinating: constructing merchandise straight generates worth and is what most builders are excited to do. Outer-loop actions are seen by most builders as obligatory however typically unsatisfying chores. Placing time into higher tooling and automation for the outer loop permits builders to spend extra time on inner-loop actions.
High tech corporations goal for builders to spend as much as 70 % of their time doing inner-loop actions. For instance, one firm that had beforehand accomplished a profitable agile transformation realized that its builders, as an alternative of coding, had been spending an excessive amount of time on low-value-added duties reminiscent of provisioning infrastructure, operating guide unit exams, and managing check information. Armed with that perception, it launched a collection of latest instruments and automation initiatives to assist with these duties throughout the software program growth life cycle.

Developer Velocity Index benchmark. The Developer Velocity Index (DVI) is a survey that measures an enterprise’s expertise, working practices, and organizational enablement and benchmarks them in opposition to friends. This comparability helps unearth particular areas of alternative, whether or not in backlog administration, testing, or safety and compliance. For instance, one firm, well-known for its technological prowess and all-star builders, sought to outline normal working practices extra thoughtfully for cross-team collaboration after discovering a excessive quantity of dissatisfaction, rework, and inefficiency reported by builders.
Contribution evaluation. Assessing contributions by people to a workforce’s backlog (beginning with information from backlog administration instruments reminiscent of Jira, and normalizing information utilizing a proprietary algorithm to account for nuances) may also help floor tendencies that inhibit the optimization of that workforce’s capability. This type of perception can allow workforce leaders to handle clear expectations for output and enhance efficiency in consequence. Moreover, it may assist determine alternatives for particular person upskilling or coaching and rethinking position distribution inside a workforce (for example, if a top quality assurance tester has sufficient work to do). For instance, one firm discovered that its most gifted builders had been spending extreme time on noncoding actions reminiscent of design classes or managing interdependencies throughout groups. In response, the corporate modified its working mannequin and clarified roles and tasks to allow these highest-value builders to do what they do greatest: code. One other firm, after discovering comparatively low contribution from builders new to the group, reexamined their onboarding and private mentorship program.
Expertise functionality rating. Based mostly on {industry} normal functionality maps, this rating is a abstract of the person data, expertise, and talents of a particular group. Ideally, organizations ought to aspire to a “diamond” distribution of proficiency, with the vast majority of builders within the center vary of competency. This may floor teaching and upskilling alternatives, and in excessive instances name for a rethinking of expertise technique. For instance, one firm discovered the next focus of their builders within the “novice” functionality than was ultimate. They deployed customized studying journeys based mostly on particular gaps and had been capable of transfer 30 % of their builders to the subsequent degree of experience inside six months.
Avoiding metrics missteps
As worthwhile as it may be, developer productiveness information will be damaging to organizations if used incorrectly, so it’s necessary to keep away from sure pitfalls. In our work we see two essential kinds of missteps happen: misuse of metrics and failing to maneuver previous outdated mindsets.
Misuse is commonest when corporations attempt to make use of overly easy measurements, reminiscent of traces of code produced, or variety of code commits (when builders submit their code to a model management system). Not solely do such easy metrics fail to generate actually helpful insights, they will have unintended penalties, reminiscent of leaders making inappropriate trade-offs. For instance, optimizing for lead time or deployment frequency can enable high quality to undergo. Specializing in a single metric or too easy a group of metrics also can simply incentivize poor practices; within the case of measuring commits, for example, builders could submit smaller modifications extra regularly as they search to recreation the system.
To really profit from measuring productiveness, leaders and builders alike want to maneuver previous the outdated notion that leaders “can’t” perceive the intricacies of software program engineering, or that engineering is simply too advanced to measure. The significance of engineering expertise to an organization’s success, and the fierce competitors for developer expertise lately, underscores the necessity to acknowledge that software program growth, like so many different issues, requires measurement to be improved. Additional, attracting and retaining high software program growth expertise relies upon largely on offering a office and instruments that enable engineers to do their greatest work and encourages their creativity. Measuring productiveness at a system degree allows employers to see hidden friction factors that impede that work and creativity.
Getting began
The mechanics of constructing a developer productiveness initiative can appear daunting, however there isn’t any time like the current to start to put the groundwork. The components driving the necessity to elevate the dialog about software program developer productiveness to C-level leaders outweigh the impediments to doing so.
The rise in distant work and its reputation amongst builders is one overriding issue. Builders have lengthy labored in agile groups, collaborating in the identical bodily area, and a few expertise leaders imagine that form of in-person teamwork is crucial to the job. Nonetheless, the digital instruments which might be so central to their work made it simple to change to distant work throughout the pandemic lockdowns, and as in most sectors, this shift is difficult to undo. As distant and hybrid working more and more turns into the norm, organizations might want to depend on broad, goal measurements to take care of confidence in these new working preparations and guarantee they’re steadily bettering the perform that might simply decide their future success or failure. The truth that the markets are actually placing better emphasis on environment friendly progress and ROI solely makes it extra necessary than ever to know the way they will optimize the efficiency of their extremely valued engineering expertise.
One other key driver of this want for better visibility is the fast advances in AI-enabled tooling, particularly large-language fashions reminiscent of generative AI. These are already quickly altering the way in which work is completed, which signifies that measuring software program builders’ productiveness is just a primary step to understanding how these worthwhile sources are deployed.
However as crucial as developer productiveness is turning into, corporations shouldn’t really feel they need to embark on an enormous, dramatic overhaul virtually in a single day. As an alternative, they will start the method with a variety of key steps:
Study the fundamentals. All C-suite leaders who are usually not engineers or who’ve been in administration for a very long time will want a primer on the software program growth course of and the way it’s evolving.
Assess your programs. As a result of developer productiveness has not sometimes been measured on the degree wanted to determine enchancment alternatives, most corporations’ tech stacks would require probably intensive reconfiguration. For instance, to measure check protection (the extent to which areas of code have been adequately examined), a growth workforce must equip their codebase with a device that may observe code executed throughout a check run.
Construct a plan. As with most analytics initiatives, getting misplaced in mountains of information is a danger. It’s necessary to begin with one space that you recognize will lead to a transparent path to enchancment, reminiscent of figuring out friction factors and bottlenecks. Be specific in regards to the scope of such a plan, as even the perfect approaches, irrespective of how complete, won’t be a silver bullet.
Keep in mind that measuring productiveness is contextual. The purpose is to have a look at a whole system and perceive the way it can work higher by bettering the event surroundings on the system, workforce, or particular person degree.
Regardless of the particular method, measuring productiveness ought to ideally create transparency and insights into key enchancment areas. Solely then can organizations construct particular initiatives to drive affect for each developer productiveness and expertise—affect that may profit each these people and the corporate as a complete.