Make, tmpfs, and [amc]time granularity

Posted on March 8, 2009 by lucas

As you might have guessed (or not), I’ve been playing with the idea of improving my Debian rebuild setup by building on tmpfs: disk I/O take a signifiant amount of time, that could easily be reduced by building (mostly) in memory.

Most packages build fine on tmpfs: only 8 packages fail only on tmpfs. Apparently, most of the failures are caused by old versions of autoconf: this version used /tmp to store temporary files ; for example, when generating Makefile, it would create it on /tmp, then move it to the current directory. The problem arises when /tmp is an ext3 filesystem, while the current directory is on tmpfs: when moving the file back to the current directory, the mtime of the file is truncated.

This can lead make to think that a given target needs to be rebuilt. A nice example of that is the smuxi package, that just loops when building.

Demo:

# cat Makefile 
f2: f1
        echo "Need to rebuild f2!"

First case: f1 and f2 are stored in the current directory. Make agrees that
f2 was created after f1:

# rm f1 f2 ; touch f1 ; touch f2 ; make
make: `f2' is up to date.

Second case: f2 is created on /tmp, then moved to the current directory. Make thinks that f2 was created before f1 (and is right, according to stat).

# rm f1 f2 ; touch f1 ; touch /tmp/f2 ; mv /tmp/f2 f2 ; make
echo "Need to rebuild f2!"
Need to rebuild f2!
# stat f1 f2     
  File: `f1'
[..]
Modify: 2009-03-08 18:21:08.900237000 +0100
  File: `f2'
[..]
Modify: 2009-03-08 18:21:08.000000000 +0100

Wysiwyg tool to create simple websites?

Posted on March 7, 2009March 7, 2009 by lucas

Hi,

I’m looking for a tool to create simple websites (5-10 pages), only static content (no PHP). Think of the typical small business or sports club website: text + pictures about what they did/do, nothing really complex.

Requirements:

usable by a non-programmer who doesn’t want to know about gory technical details (HTML, CSS, etc)

doesn’t make it too hard to create a website with a few pages (i.e it should do a bit more than just allowing to create one page)

built-in support for (S)FTP uploading of the website, usable by a non-programmer after someone computer-litterate will have set it up. (that’s not an absolute requirement, but would still be nice to have. also, what are the nice graphical tools to do that, currently?)

I’m asking for two different persons with different needs. So please don’t limit your replies to free software (although it would be nice): cheap proprietary applications might be adequate as well. Regarding platform, Linux, Mac OS X or Windows would all be suitable.

I don’t think that a CMS is the right tool here: the focus should be put on the content (which won’t be often modified anyway) and its presentation, not on the ease of modification. For example, I’d like to be able to position pictures in a precise way. However, I might be interested in a CMS that is:

easy to setup (for a reasonably computer-literate person)

extremely easy to learn (a couple hours at most, for a non-programmer)

easy to transform into something that looks good

In your comments, it would be great if you could include examples of what you did with those tools (yes, links are welcomed!)

Thank you.

PhD !

Posted on March 4, 2009March 4, 2009 by lucas

This blog post is long overdue, but like Lenny, it was finally released. Yes, I defended my PhD thesis on December 4th. So I’m officially a Doctor from the UniversitÃ© Joseph Fourier Grenoble.

The general context of my thesis is distributed systems, like peer-to-peer systems and applications running on HPC (High Performance Computing) cluster and grids. More specifically, I worked on methods and tools to study such systems, trying to answer the following questions: take a given application that will run on a distributed system. What can we learn or verify about its performance or its characteristics? Will it scale well, even with 1000 or 10000 nodes? Does this app perform better than this another app? Under which conditions? (that’s probably the hardest question, because you need to be able to run both apps under the same conditions if you want a fair comparison!)

My work involved studying basic building blocks that can be used to study distributed systems, like network emulators (Dummynet, Linux’s traffic control subsystem, NISTNet). I also designed P2PLab, which allows to emulate network-centric peer-to-peer systems using a cluster, targetting experiments with thousands of emulated instances of the application. For example, it allowed to run experiments with ~15000 emulated BitTorrent nodes using 160 real machines.

So what’s next? Well, my first-priority plan is to make use of my PhD to apply to an academic position (maitre de confÃ©rences, i.e assistant professor). If you know an open position where I would be a good fit, don’t hesitate to tell me! Of course, this plan might very well fail (I will know around end of June). In that case, I’ll probably be looking for a job, likely Free-Software-related. Again, if you hear of something for me, ping me!

Creating a large file without zeroing it: update

Posted on March 3, 2009 by lucas

Given the large number of comments I got (26!), I feel obliged to post a summary of what was said.

First, the problem:
I want to create a large file (let’s say 10 GB) to use as swap space. This file can’t be a sparse file (a file with holes, see wikipedia if you don’t know about sparse files).
Since I’m going to mkswap it, I don’t care about the data that is actually in that file after creating it. The stupid way (but only solution on ext3) to create it is to fill it with zeroes, with is very inefficient.

Theodore Tso provided more information in a comment, which I’m copying here:

Yes, it will work on ext4. A convenient which makes this easy to use can be found here at http://sandeen.fedorapeople.org/utilities/fallocate.c. It was written by Eric Sandeen, a former XFS developer who now works for Red Hat, who has been a big help making sure ext4 will be ready for Fedora and Red Hat Enterprise Linux. (Well, I guess I shouldn’t call him a former XFS developer since he still contributes patches to XFS now and then, but he’s spending rather more time on ext4 these days.)

One warning about the program; it calls the fallocate system call directly, and it doesn’t quite have the right magic architecture-specific magic for certain architectures which have various restrictions on how arguments need to be passed to system calls. In particular, IIRC, I believe there will be issues on the s390 and powerpc architectures. The real right answer is to get fallocate into glibc; folks with pull into making glibc do the right thing, please talk to me.

Glibc does have posix_fallocate(), which implements the POSIX interface. posix_fallocate() is wired to use the fallocate system call, for sufficiently modern versions of glibc.

However, posix_fallocate() is probablematic for some applications; the problem is that for filesystems that don’t support fallocate(), posix_fallocate() will simulate it by writing all zeros to the file. However, this is not necessarily the right thing to do; there are some applications that want fallocate() for speed reasons, but if the filesystem doesn’t support it, they want to receive the ENOSPC error message, so they can try some other fallback — which might or might not involve writing all zero’s to the file.

The other shortcoming with posix_fallocate() is that it doesn’t support the FALLOC_FL_KEEP_SIZE flag. What this flag allows you to do is to allocate disk blocks to the file, but not to modify the i_size parameter. This allows you to allocate space for files such as log files and mail spool files so they will be contiguous on disk, but since i_size is not modified, programs that append to file won’t get confused, and tail -f will continue to work. For example, if you know that your log files are normally approximately 10 megs a day, you can fallocate 10 megabytes, and then the log file will be contiguous on disk, and the space is guaranteed to be there (since it is already allocated). When you compress the log file at the end of the day, if the log file ended up being slightly smaller than 10 megs, the extra blocks will be discarded when you compress the file, or if you like, you can explicitly trim away the excess using ftruncate().

fallocate works fine: creating a 20 GB file is almost immediate. Also, syncing or umounting the filesystem is also immediate, and reading the file returns only zeros. I’m not sure how it is implemented, but it looks nice :-). However, it still doesn’t solve my initial problem: mkswap works, but not swapon:

:/tmp# touch tmp
:/tmp# /root/fallocate -l 10g tmp
:/tmp# ls -lh tmp 
-rw-r--r-- 1 root root 10G Mar  3 11:01 tmp
:/tmp# du tmp 
10485764	tmp
:/tmp# mkswap tmp 
Setting up swapspace version 1, size = 10737414 kB
no label, UUID=a316ce8e-cf33-412b-8dc0-e10d9f2ebdbb
:/tmp# strace swapon tmp
[...]
swapon("/tmp/tmp")                      = -1 EINVAL (Invalid argument)
write(2, "swapon: tmp: Invalid argument\n"..., 30swapon: tmp: Invalid argument
) = 30
exit_group(-1)

(swapon works fine if the file is created normally — without using fallocate()).

Any other ideas?

Creating a large file without zeroing it

Posted on February 25, 2009 by lucas

Dear readers,

I’d like to use a large file (> 10 GB) as swap space.
The required steps are:

Create a file
mkswap $file
swapon $file

Now, how can I create the file? The obvious and fast solution is to create a file with holes:
dd if=/dev/zero of=foo bs=1M count=1 seek=10239
mkswap works, but swapon complains:
# swapon /tmp/foo swapon: Skipping file /tmp/foo - it appears to have holes.

Of course, I count just dd if=/dev/zero of=foo bs=1M count=10240, but that takes too long for me.

So, question: is there a way to tell the system: create a file that is 10GB big, don’t but fill it with zeros?

Re: GIT sucks

Posted on February 17, 2009February 17, 2009 by lucas

Scott has several posts about git (1, 2, 3), and the “I worked on this locally, now I want to push this to a remote place where others will be able to fetch it”.

I’ve been using the following snippet for that. YMMV.
REPO=reponame ROOT=/wheremygitreposare/ SERV=remoteserver git clone --bare . /tmp/${REPO}.git && \ rsync -avzP /tmp/${REPO}.git ${SERV}:$ROOT && \ git remote add origin ${SERV}:${ROOT}${REPO}.git && \ git config branch.master.remote origin && \ git config branch.master.merge refs/heads/master

(Please post comments if you can improve it!)
But I totally agree that this common use case is not well addressed in git.

Ultimate Debian Database talk at FOSDEM

Posted on February 9, 2009 by lucas

Of course, I was in Brussels this week-end, for FOSDEM. I gave a talk about Ultimate Debian Database (slides here) in the Debian devroom. The conclusion was “UDD is ready, go play with it!” so you know what you should do now! FOSDEM in general was great and huge (see the video from Quim Gil if you still doubt that) as usual.

On a more sad note, the worst talk of the week-end was without any possible doubt Frans Pop’s release team bashing. Nobody is claiming that the release management of the lenny release cycle was perfect: there’s always room for improvement. But given the context and the constraints, I think that they did a very good job. Frans’ talk boiled down to: “The release team doesn’t know what they are doing, I would have done much better because I’m so qualified.” Giving such talks at FOSDEM, in front of an audience with many people not involved in Debian development, is really insulting. To make things worse, he finished the talk late, not leaving any time for questions/discussion, so the audience might be left with the impression that his opinions are representative of the opinions of DDs.

On Debian membership

Posted on January 10, 2009 by lucas

The more I think about it, The more I’m convinced that the both approaches that have been proposed to change the way we deal with Debian membership (Joerg’s new statuses for non-developing contributors, and Lars’ counting of advocates) are both steps in the wrong direction.

I really would like something simpler, and less technical, to become a Debian Developer. After all, what matters is that we trust new developers to make the right decisions (for example, not touch something that they are not confortable with). We currently verify that they know each and every technical bit about things that they may never encounter during their DD life, but that doesn’t say anything about their ability to work with others inside a community, or their judgement when touching packages that they don’t maintain.

For example, we could have something inspired by the current process to become a Debian Maintainer: instead of going through 50+ questions, we could rely on advocations from existing Debian Developers to judge new applicants. The advocation wouldn’t simply be “+1” emails (as in Lars’ proposal), but would instead contain a detailed description of what the applicant did. A group of DDs (DAMs, basically) would then read the advocation emails, and decide whether there’s sufficient proof that the applicant can be granted DD status.

That would allow a much faster process, removing the load from the Application Managers and NM Frontdesk Members. And it probably wouldn’t reduce the quality of new DDs, because of the need to collect advocation emails from several DDs.

Probably the best way to move forward with that would be to work on a document (DEP?) that would list all the changes and explain them, and then ask all the developers to vote on it, like it was done for the Debian Maintainers GR. Are there some people interested on working on a first draft of such a document?

Date/time anecdotes

Posted on January 7, 2009 by lucas

If you don’t know that leap years don’t always happen every 4 years, you really should read about the switch from the Julian to the Gregorian calendar. PostgreSQL’s documentation on the topic is also interesting, as well as the output of ncal -p.

On a related subject, the insertion of a leap second triggered a few interesting blog posts from Dave Jones (1, 2, 3), and also a Linux bug, reported on slashdot, and already fixed.

(Yes, that post was mostly an I-need-to-bookmark-this post.)

membership status results

Posted on December 16, 2008 by lucas

So, in the Membership Status GR, we sent a clear message that we wanted those decisions to be postponed. Now, we have a responsibility to prove that we were not just rejecting any change, but that our vote was really a vote against the way those decisions were announced, and/or those particular decisions. I’m convinced that our management of membership needs to be improved.

I hope that in the next weeks/months, we will be able to discuss proposals about membership status. There’s Joerg’s, there’s Lars’, there’s Raphael’s, and there might be other ideas. Ideally, each of them would be discussed and improved separately (because we can’t be expected to agree on one proposal right from the start), and finally we would get to vote on a coherent set of proposals (with an unbiaised ballot ;) to choose in which direction we want to change our management of membership. It would be great to use the DEP process or something similar for that.

It is also interesting to compare this whole story with our current issue: a secretary that ignored valid criticism about a ballot and went on with asking to vote on it. In both cases, someone that is generally trusted and respected by the project made a mistake, and failed to acknowledge it, requiring strong actions from other DDs. I wonder how the secretary crisis will end…

Lucas Nussbaum's Blog

Category: PlanetDebian