One thing worth noting is that "redirection of streams" is a poor metaphor.
If I have two streams,
1 ---------- 1'
2 ---------- 2'
and I redirect the 1>2
1 ----
\
2 ---------- 2'
it seems to work okay.
But if you have three,
1 ---------- 1'
2 ---------- 2'
3 ---------- 3'
and try "redirecting 1 to 2 then 2 to 3", the natural reading would be
1 -
\
2 -----
\
3 ---------- 3'
But that's not what you get, because you're not "redirecting streams" - you're copying file descriptors. This is partially fixed by informing people to do the operations backwards, but there are corner cases where that breaks down, and "copying file descriptors in the order they appear on the line" never does because that's genuinely what's going on.
I guess what you're explaining is the process that happens as the kernel reserves fd's? And when you start your program there isn't necessarily any guarantee that a particular fd will be available, except for 0,1,2, right?
Copying file descriptors is genuinely what's going on, so understanding it that way always works.
Edited for clarity:
The way file descriptors work is that the kernel maintains, for each process, a table mapping from numerical file descriptors to structures describing the open file/socket/device associated with that descriptor.
When the shell is setting up to spawn a new process (or subshell, for that matter), it processes the redirections in order, left to right. For redirections of the form "A>&B" this is a call to dup2(B, A), which has the kernel copy the entry in the table at B over the entry at A.
Incidentally, you will note the direction of the symbol doesn't matter when cloning file descriptors: 5>&7 and 5<&7 mean the same thing - you're just copying a file descriptor. It does matter when opening a file, as the file will be opened for reading or writing before before being dup2'ed over the requested descriptor. Having said that, of course use the correct symbol - it's good documentation.
Thanks for the this. In my opinion, there's too little written about passing fd's (use of dup2). It seems like a technique that is more useful than just as part of shell redirection. Safer than popen()?
I'm pretty sure popen is just a wrapper around this stuff. It's basically,
pipe: get two fds connected by a pipe
fork: create a child process
dup2: move the read fd of the pipe to 0 (stdin) in the new process
exec: run the program in the child process
There's some cleanup but that's the gist - popen isn't a syscall (or reasonably close to one), so must necessarily rely on other stuff to get its work done where it actually interfaces with the kernel.
That makes perfect sense. It's the fork+exec that makes it dangerous (e.g. in the CGI context). This is something I've been wondering about and you have just provided a jolt of clarity. Many thanks, again.
One thing missing is <<- heredocs, which strip (some) leading whitespace. It lets you keep up your indentation in a shell script without passing a bunch of whitespace to the commands.
to be particularly useful in the past for pushing generated data into something that doesn't accept - as an alias for STDIN. I'm not sure if it's a different way of expressing one of the listed items though; iirc it creates a temporary fifo
> Does anyone know the rationale behind ordering of redirects?
I can't give you an exact rationale, but my old shell scripting books from the 90's use the cmd > file 2>&1 notation. My personal belief is that it was for situations when you would do something like this:
The network pseudo filehandles (/dev/tcp/<host>/<port> and /dev/udp/<host>/<port>) are explicitly disabled in certain distros (notably Debian). Ubuntu appears to support it.
Though not limited to just shell redirections, one of the true masters of bash (and zsh) is Larry Peek whose written/given multiple books, articles, and presentations on the topic.
the "cmd <<EOL ..\n..\n EOL" part has been mystical to me for ages, especially as I didn't know what to google to find out. I recently learned it was called a "Here Document". http://en.wikipedia.org/wiki/Here_document
For extra credit, try using a "Here Document" as input for 'echo' and 'cat'. Explain the results.
Is there a particular reason you'd go for the PNG version over the PDF version? It's a more suitable format for these "cheat sheet" type pages, since it'll use the platform text rendering, and also keep the vector information (meaning it'll print better if you want to hang it up next to your desk).
If I have two streams,
and I redirect the 1>2 it seems to work okay.But if you have three,
and try "redirecting 1 to 2 then 2 to 3", the natural reading would be But that's not what you get, because you're not "redirecting streams" - you're copying file descriptors. This is partially fixed by informing people to do the operations backwards, but there are corner cases where that breaks down, and "copying file descriptors in the order they appear on the line" never does because that's genuinely what's going on. foo 1>2 2>3This is "run foo copying fd 2 to 1 and 3 to 2"
which gives