This post is the latest in a series of posts about tracking processes on Linux which looks at the fine details of various APIs.
Here is some sample output from the Netlink Process Connector API on Linux. It contains an odd looking fork1:
EventType=FORK ParentPid=100 ParentTgid=99 ChildPid=200 ChildTgid=199
Take a moment to think about what is going on here and who gave birth to who.
At first glance, this fork looks impossible. The child doesn’t share a thread-group (tgid) with the parent, so it looks like a new process was created and not a new thread. However, if a new process was created then we should have a process with only one thread and therefore ChildPid should equal ChildTgid.
After a brief look at kernel sources, it turns out that the ParentPid and ParentTgid refer to the new process’ current parent which isn’t necessarily the process which called fork and birthed a child. In this case, we’re looking at a pretty standard report on a new thread’s creation by
clone(CLONE_THREAD). Threads have the same parent process as the thread which created them. Something like this happened:
- Process A (pid 99, tgid 99) created a new thread B
- Thread B (pid 100, tgid 99) created a new process C
- Process C (pid 199, tgid 199) created a new thread D
- D has pid 200, tgid 199. This is expected because it is a thread in C’s process group. Likewise, it’s parent is thread B in process A. Therefore ParentPid=100 and ParentTgid=99.
Tracking processes accurately is one of those things that seem easy at first but turn out to be full of odd edge cases. If you liked this post, you should also read my post on a strange bug where the netlink process connector “forgot” to send events or my comparison of all the methods of tracking running processes on linux.
- I use the term fork in this entire post to refer to all fork-like sycalls including clone [return]