[go: up one dir, main page]

|
|
Log in / Subscribe / Register

Kernel development

Brief items

Kernel release status

The current 2.6 prepatch is 2.6.23-rc6, released by Linus on September 10. The number of fixes this time around is relatively small, partly because many of the developers were off at the kernel summit last week. The long-format changelog has the details.

The flow of patches into the mainline git repository continues; there will almost certainly need to be an -rc7 release before this kernel is done.

There have been no -mm releases over the last week.

For older kernels: 2.6.20.19 was released on September 8 with one security fix in the IPv6 code.

2.4.35.2 was released on September 8; it contains mostly compiler-related fixes. 2.4.36-pre1 was also released on the 8th; it contains a few fixes and a patch to optionally prevent processes from mapping the NULL address.

Comments (none posted)

Kernel development news

Quotes of the week

So I'm doing an inverted reverse polish bisection search to find out which patch preemptively fixes clockevents-fix-resume-logic.patch. Try doing that with git, suckers.
-- Andrew Morton

C++ is a horrible language. It's made more horrible by the fact that a lot of substandard programmers use it, to the point where it's much much easier to generate total and utter crap with it. Quite frankly, even if the choice of C were to do *nothing* but keep the C++ programmers out, that in itself would be a huge reason to use C.
-- Linus Torvalds

Comments (24 posted)

The 2007 Kernel Summit

By Jonathan Corbet
September 6, 2007
[Linus Torvalds] The 2007 version of the Linux Kernel Developers' Summit was held on September 5 and 6 in Cambridge, UK. Approximately 80 developers at this invitation-only event held discussions on a wide variety of topics covering all aspects of kernel development. As usual, LWN editor Jonathan Corbet was there. Reports from the sessions will appear below as they are written.

Day 1

  • The distributor panel. Kernel maintainers from four distributors attended a session meant to be a forum where they could tell the community how the process could be improved from their point of view. In the event, much of the information flowed in the other direction, with community developers expressing frustration with a number of distributor practices.

  • Mini-summit reports. Reports from mini-summits covering power management, filesystems and storage, virtual memory, and virtualization held in the months prior to the main kernel summit.

  • The greater kernel ecosystem and user-space APIs. A discussion of how the kernel presents interfaces to user space and the low-level software which helps with this task. Also covered here is the session on a proposal for a formal review process for new system calls.

  • Kernel quality. In the session he led on this topic, Andrew Morton was unable to say whether he thought our kernel releases were getting better or worse. But he had no doubt that we could be doing a better job than we are now.

  • Hardware support and the i386/x86_64 merger. This was a discussion of the state of drivers for various difficult chipsets; it included AMD's important announcement of the opening of its graphics processors. There was also a session on the question of whether the i386 and x86_64 architecture trees should be merged.

Day 2

The preparation of reports from the second day is being somewhat delayed by your editor's travel. They will show up here as they become available.

  • The customer panel. An interesting discussion of customer needs by representatives from Dreamworks, Credit Suisse, and the Linux Foundation.

  • Realtime and syslets. What is the status of the realtime patch set, and what's next for syslets?

  • Scalability. Issues for people trying to run Linux on very large and very small systems.

  • Memory management. Discussions on large page support, test cases for memory management patches, and letting applications help with memory pressure.

  • Containers. What remains to be done to have a complete containers implementation in the mainline kernel.

  • Developer relations and development process. How can the community bring in more developers and avoid driving away those who are here now? This question was addressed, along with a number of nuts-and-bolts issues relating to how the development process works.

  • Closing session. The final session of the 2007 kernel summit was about the kernel summit itself. Was this event what the attendees had hoped for, and how should things be done in the future?

The group picture

How could there be a kernel summit without a group picture? Here is (most of) the group in front of the Downing College dormitory where many of us stayed:

[Summit group photo]

This photo is available in the following forms:

By popular demand, we also have an annotated version of the full-resolution image with names assigned to as many faces as possible.

Thanks to Michael Kerrisk for operating your editor's camera, allowing him to be in the group picture for the first time.

Comments (none posted)

Exported symbols and the internal API

By Jonathan Corbet
September 11, 2007

Loadable kernel modules do not automatically have access to all symbols (functions and variables) defined in the kernel. In fact, access is limited to those symbols which have been explicitly exported for modular use. The idea behind this whitelist-like policy is that it helps the kernel developers to keep the module interface under control, limiting the ability of modules to dig into parts of the kernel where they are not welcome. The practice turns out to be a little more messy: current kernels have over 16,000 EXPORT_SYMBOL() declarations sprinkled around the source.

Unsurprisingly, there are developers who would like to reduce the number of exported symbols. It is often the case that, once a symbol can be shown to have no users among in-tree modules, it will be removed altogether. But there is not universal agreement on just how this process should be handled; as a result, we see occasional debates on how stable the modular API should actually be and what provisions should be made for out-of-tree code.

Adrian Bunk recently posted a patch to unexport sys_open() and sys_read(). These symbols (which implement the open() and read() system calls) have been on the hit-list for a long time. It is easy to make catastrophic mistakes when using them from kernel space, and there is almost no situation where opening and reading files from within the kernel is considered to be the right thing to do. But removing the exports has always proved hard, until now - there have always been stubborn in-tree users which have kept the export around.

The final holdout in 2.6.23 is the wavefront sound driver which uses sys_open() and sys_read() to obtain firmware to load into the device. The kernel has had a proper API for dealing with firmware loads for years, so no driver should be trying to read firmware directly from files itself. The current ALSA development tree contains a patch for the wavefront driver which makes it use the firmware API; once that patch is merged, there will be no more in-tree users of those symbols. Adrian, forever on the lookout for things to remove from the kernel, noticed this fact and promptly sent in a patch.

Andrew Morton's response went like this:

But I think it is better to give people some warning when we're planning on breaking out-of-tree things. I do occasionally receive reports of "hey, the X driver which I get from Y doesn't work any more". Often it's open-source stuff, too. I see no point in irritating our users more than we need to.

Andrew would like to have the symbols marked with EXPORT_UNUSED_SYMBOL() for one development cycle so that maintainers of out-of-tree code can get the resulting warning message and fix their code in response. It quickly became clear that he is in a minority among the developers on this issue. Adrian was particularly upset, complaining that other developers are allowed to make no-warning changes which break almost every module in existence while his patch, which affects very few modules, must go through a special process. He says:

Andrew, please define API rules, IOW rules for addition, removal and changing of exported code, that are valid for *everyone* or go to hell with your EXPORT_UNUSED_SYMBOL.

Christoph Hellwig also responded strongly, leading to this amusing (but not for the easily offended) exchange. Calmer voices made a few arguments against the warning period:

  • These symbols have been on the chopping block for a long time, and most out-of-tree module authors should have figured that out by now. It is worth noting, though, that the feature removal schedule in the kernel documentation says nothing about sys_open() and sys_read().

  • In this sort of situation warnings are almost entirely ineffective. Users tend not to see them at all, and they do not report them in any case. According to Alan Cox: "Short of using their sound card to scream 'Next release you are screwed' they won't notice (and if you the sound card trick they'll think they got rooted....)"

  • Keeping unused symbols around bloats the kernel and increases the load on developers who must remember to remove them in a future release.

Andrew does not appear willing to budge on the issue, though. He does not want to unnecessarily upset users who use out-of-tree modules:

Fact is, people use external modules. To get their machines working correctly, to get their work done, to do stuff they want done.

Many of these people are non-programmers. So when they download a new kernel and find that the module which they use doesn't work because of something which we've done, they get pissed off, and we lose a tester. This has happened many times.

To avoid this problem, he wants exported symbols targeted for removal to marked with EXPORT_UNUSED_SYMBOL() (or EXPORT_UNUSED_SYMBOL_GPL()) for one development cycle. The exports should be marked with a comment noting when the export should be removed altogether. Each release cycle would include a quick grep to find the symbols which are now due to be removed for real. He concludes:

Total cost of this effort: maybe ten developer minutes per release, and a few tens of additional bytes in the released vmlinux. I think that for a few additional testers and a few less-pissed-off users (nothing to do with developers), this cost is justified. That's all.

Elsewhere he has noted that, if a warning is sufficiently widespread, somebody, somewhere, will act on it. One gets the sense that he has not convinced a whole lot of developers that this position is right. But Andrew is in a position to enforce it and most of the others seem to think that, in the end, it's easier to just go along with what he wants in this case. The end result is the same, it just takes a little longer.

Comments (5 posted)

Who wrote 2.6.23

By Jonathan Corbet
September 11, 2007

While the 2.6.23 development cycle has not yet run its course, things are getting close enough to the end that it makes sense to start looking at the overall statistics for this release. As of this writing (shortly after 2.6.23-rc6 came out), just over 6,200 non-merge changesets had been added to the mainline kernel repository. These changesets came from 854 developers - a slightly smaller number than we saw for 2.6.22. Just over 350 of those developers contributed one single changeset.

All told, the patches added almost 430,000 lines, but also removed 406,000 lines, meaning that the kernel grew by just under 23,000 lines - a relatively small number. That is partially a result of kernel hatcheteer Adrian Bunk's work: he removed the old SpeedStep code, a number of Open Sound System drivers, Rise CPU support, and more - a total of almost 73,000 lines removed. Jeff Garzik hacked out over 41,000 lines of network driver code, and Jens Axboe got rid of over 25,000 lines of code, mostly in the form of ancient CDROM drivers.

Here is the list of the top contributors to 2.6.23, as counted by changesets merged and by lines of code changed:

Most active 2.6.23 developers
By changesets
Ingo Molnar1522.5%
Ralf Baechle1191.9%
Trond Myklebust1161.9%
Paul Mundt1111.8%
David S. Miller1071.7%
Tejun Heo1031.7%
Al Viro951.5%
Patrick McHardy931.5%
Adrian Bunk921.5%
FUJITA Tomonori911.5%
Avi Kivity721.2%
Andrew Morton711.1%
Greg Kroah-Hartman621.0%
Alan Cox580.9%
David Brownell560.9%
Jeff Garzik550.9%
Christoph Hellwig540.9%
Stephen Hemminger530.9%
H. Peter Anvin520.8%
Jesper Juhl520.8%
By changed lines
Adrian Bunk7325411.0%
Jeff Garzik432536.5%
Jens Axboe280044.2%
Hirokazu Takata203993.1%
Yoichi Yuasa183682.8%
James Smart156262.4%
Jeremy Fitzhardinge153982.3%
David S. Miller147522.2%
Matthew Wilcox147502.2%
Christoph Hellwig145502.2%
Rusty Russell94521.4%
Imre Deak89251.3%
Dan Williams85101.3%
Ralf Baechle83451.3%
Doug Thompson73101.1%
Yoshihiro Shimoda69811.1%
Marc St-Jean68881.0%
Luca Olivetti65401.0%
Cyrill Gorcunov63711.0%
Latchesar Ionkov53750.8%

Ingo Molnar comes out on top of the changesets column by virtue of getting the CFS scheduler merged - then fixing it. Over half of his patches were accepted after 2.6.23-rc1 came out. Ralf Baechle and Paul Mundt both contributed many changes to architecture-specific trees, Trond Myklebust did a lot of NFS work, and, while David Miller had a number of networking patches, the bulk of his changesets were in the architecture-specific (SPARC) trees. The figures on the "by changed lines" side are dominated by code removals (as described above); Jens Axboe also did a bunch of splice work and merged the "bsg" generic SCSI driver. Hirokazu Takata did a bunch of m32r architecture work. James Smart contributed a number of Fibre Channel changes and Jeremy Fitzhardinge merged the core Xen code.

Once again, we have put some effort into associating patches with the companies that supported this work, with the results shown below. These results should always be taken as approximations; we believe that they are essentially correct, but patches do not come with Paid-for-by: headers, so a certain amount of guessing is always required.

Most active 2.6.23 employers
By changesets
(Unknown)118019.0%
Red Hat74412.0%
(None)5599.0%
IBM5078.2%
Novell4216.8%
Intel1843.0%
Oracle1462.4%
Renesas Technology1342.2%
MIPS Technologies1191.9%
NetApp1161.9%
(Consultant)1031.7%
Google991.6%
NTT981.6%
Sony931.5%
Astaro931.5%
Linux Foundation821.3%
MontaVista811.3%
SGI771.2%
Qumranet721.2%
QLogic621.0%
By lines changed
(Unknown)11177716.9%
(None)9964915.0%
Red Hat8422412.7%
IBM394495.9%
Oracle362055.5%
Renesas Technology331525.0%
HP187182.8%
Tripeaks185672.8%
Novell179902.7%
Emulex159422.4%
XenSource154262.3%
Intel149622.3%
Sony119451.8%
Analog Devices103451.6%
rPath96781.5%
MIPS Technologies91711.4%
Solid Boot Ltd.89371.3%
MontaVista80651.2%
PMC-Sierra68881.0%
Astaro66871.0%

Red Hat retains its place at the top of the by-changesets list, though its percentage of changes has dropped a bit. By lines changed, developers known to be working on their own time (the "None" entry) beat out all corporate contributors. It is worth noting that much of lines-changed count for those developers is, in fact, lines removed.

Looking at who added Signed-off-by: lines to patches is interesting, especially if one looks at signoffs added by people other than the author of the patch. In this way, one gets an idea of who the gatekeepers are. There is a slight change to how this calculation was done this time around: if a patch carried signoffs from both Linus Torvalds and Andrew Morton, Linus's was not counted. As a result of how the process works, everything that goes through Andrew gets a signoff from Linus; not counting those signoffs gives a more accurate picture of how the review was actually done.

Developers with the most signoffs (total 5653)
Andrew Morton124721.6%
Linus Torvalds3976.9%
David S. Miller3816.6%
Greg Kroah-Hartman3295.7%
Jeff Garzik2875.0%
James Bottomley2644.6%
Paul Mackerras2233.9%
Mauro Carvalho Chehab1502.6%
Len Brown1282.2%
Ralf Baechle1222.1%
Roland Dreier1162.0%
Andi Kleen1132.0%
Russell King1011.8%
Jaroslav Kysela1001.7%
John W. Linville701.2%
Tony Luck651.1%
Takashi Iwai631.1%
Jens Axboe581.0%
Martin Schwidefsky551.0%
Ingo Molnar510.9%

One question which comes up sometimes is: how do these numbers look for specific parts of the kernel tree? Your editor duly hacked on his scripts to generate this sort of information. Here is a summary of the results - using the employer by-changesets numbers:

Employer changeset contributions by subsystem
/arch (1428 total)
(Unknown)22215.5%
IBM19813.9%
Red Hat1289.0%
(None)1087.6%
Renesas Technology1017.1%
MIPS Technologies896.2%
Sony553.9%
Novell463.2%
Intel463.2%
rPath422.9%
/block (103 total)
NTT2726.2%
Oracle1514.6%
(Unknown)109.7%
IBM87.8%
Red Hat65.8%
(None)54.9%
Miracle Linux43.9%
Computer Consultants32.9%
Novell32.9%
Sony32.9%
/Documentation (241 total)
(Unknown)6627.4%
Novell2711.2%
IBM197.9%
Oracle197.9%
(None)187.5%
Intel166.6%
Red Hat135.4%
(Consultant)62.5%
Freescale52.1%
NEC41.7%
/drivers (2762 total)
(Unknown)57220.7%
(None)35612.9%
Novell2378.6%
Red Hat2368.5%
IBM1916.9%
Intel1304.7%
(Consultant)682.5%
NTT652.4%
Qumranet632.3%
QLogic612.2%
/fs (622 total)
Red Hat10717.2%
Oracle8012.9%
NetApp7411.9%
(Unknown)7211.6%
Novell6310.1%
IBM569.0%
Univ. of Michigan CITI355.6%
SGI264.2%
(Academia)193.1%
SWsoft172.7%
/kernel (938 total)
Red Hat25927.6%
(Unknown)12913.8%
IBM11912.7%
Renesas Technology525.5%
(None)444.7%
Novell363.8%
MIPS Technologies313.3%
Fujitsu303.2%
Intel283.0%
Linutronix272.9%
/mm (261 total)
IBM3814.6%
(Unknown)3814.6%
Renesas Technology3312.6%
SGI2911.1%
Novell249.2%
Google197.3%
Red Hat135.0%
(None)103.8%
ARM72.7%
igel62.3%
/net (833 total)
(Unknown)17821.4%
Astaro9211.0%
Red Hat8710.4%
(None)718.5%
IBM536.4%
Linux Foundation485.8%
NetApp475.6%
Broadcom232.8%
Intel182.2%
HP172.0%

From these numbers, one might conclude that Red Hat developers are strong in the core kernel area, but they don't much like writing documentation. There is a lot of "hobbyist" participation in the driver subtree - not a particularly surprising result, since making a specific device work is a common itch for developers to scratch. Academics like to play with filesystems, as do, unsurprisingly, companies like Oracle and NetApp.

Beyond being approximate, all of the numbers shown above will change a bit before the final 2.6.23 release, which is probably at least three weeks away. The patches which will be merged in the coming weeks should all be fixed, though, so the changes will, with any luck at all, be small. All told, 2.6.23 shows an active kernel development community with contributions from a large number of developers - and quite a few companies which employ them. The kernel remains a vibrant and alive base on which to build our free systems.

(Thanks are due to Greg Kroah-Hartman for his contributions to the scripts used to generate these statistics).

Comments (6 posted)

Patches and updates

Kernel trees

Linus Torvalds Linux 2.6.23-rc6 ?
Willy Tarreau Linux 2.6.20.19 ?
Willy Tarreau Linux 2.4.36-pre1 ?
Willy Tarreau Linux 2.4.35.2 ?

Architecture-specific

Build system

Core kernel code

Development tools

Device drivers

Documentation

Filesystems and block I/O

Janitorial

Memory management

Security-related

Virtualization and containers

Miscellaneous

Page editor: Jonathan Corbet
Next page: Distributions>>


Copyright © 2007, Eklektix, Inc.
Comments and public postings are copyrighted by their creators.
Linux is a registered trademark of Linus Torvalds