FORK()函數(shù)的理解
對(duì)于剛剛接觸Unix/Linux操作系統(tǒng),在Linux下編寫多進(jìn)程的人來說,fork是最難理解的概念之一:它執(zhí)行一次卻返回兩個(gè)值。
首先我們來看下fork函數(shù)的原型:
#i nclude
#i nclude
pid_t fork(void);
返回值:
負(fù)數(shù):如果出錯(cuò),則fork()返回-1,此時(shí)沒有創(chuàng)建新的進(jìn)程。最初的進(jìn)程仍然運(yùn)行。
零:在子進(jìn)程中,fork()返回0
正數(shù):在負(fù)進(jìn)程中,fork()返回正的子進(jìn)程的PID
其次我們來看下如何利用fork創(chuàng)建子進(jìn)程。
創(chuàng)建子進(jìn)程的樣板代碼如下所示:
pid_t child;
if((child = fork())<0)
/*錯(cuò)誤處理*/
else if(child == 0)
/*這是新進(jìn)程*/
else
/*這是最初的父進(jìn)程*/
fock函數(shù)調(diào)用一次卻返回兩次;向父進(jìn)程返回子進(jìn)程的ID,向子進(jìn)程中返回0,
這是因?yàn)楦高M(jìn)程可能存在很多過子進(jìn)程,所以必須通過這個(gè)返回的子進(jìn)程ID來跟蹤子進(jìn)程,
而子進(jìn)程只有一個(gè)父進(jìn)程,他的ID可以通過getppid取得。
下面我們來對(duì)比一下兩個(gè)例子:
第一個(gè):
#include
#include
int main()
{
pid_t pid;
int count=0;
pid = fork();
printf( "This is first time, pid = %dn", pid );
printf( "This is secONd time, pid = %dn", pid );
count++;
printf( "count = %dn", count );
if ( pid>0 )
{
printf( "This is the parent process,the child has the pid:%dn", pid );
}
else if ( !pid )
{
printf( "This is the child Process.n")
}
else
{
printf( "fork failed.n" );
}
printf( "This is third time, pid = %dn", pid );
printf( "This is fouth time, pid = %dn", pid );
return 0;
}
運(yùn)行結(jié)果如下:
問題:
這個(gè)結(jié)果很奇怪了,為什么printf的語句執(zhí)行兩次,而那句“count++;”的語句卻只執(zhí)行了一次
接著看:
#include
#include
int main(void)
{
pid_t pid;
int count=0;
pid = fork();
printf( "Now, the pid returned by calling fork() is %dn", pid );
if ( pid>0 )
{
printf( "This is the parent procESS,the child has the pid:%dn", pid );
printf( "In the parent process,count = %dn", count );
}
else if ( !pid )
{
printf( "This is the child process.n");
printf( "Do your own things here.n" );
count ++;
printf( "In the child process, count = %dn", count );
}
else
{
printf( "fork failed.n" );
}
return 0;
}
運(yùn)行結(jié)果如下:
現(xiàn)在來解釋上面提出的問題。
看這個(gè)程序的時(shí)候,頭腦中必須首先了解一個(gè)概念:在語句pid=fork()之前,只有一個(gè)進(jìn)程在執(zhí)行這段代碼,但在這條語句之后,就變成兩個(gè)進(jìn)程在執(zhí)行了,這兩個(gè)進(jìn)程的代碼部分完全相同,將要執(zhí)行的下一條語句都是if ( pid>0 )……。
兩個(gè)進(jìn)程中,原先就存在的那個(gè)被稱作“父進(jìn)程”,新出現(xiàn)的那個(gè)被稱作“子進(jìn)程”。父子進(jìn)程的區(qū)別除了進(jìn)程標(biāo)志符(process ID)不同外,變量pid的值也不相同,pid存放的是fork的返回值。fork調(diào)用的一個(gè)奇妙之處就是它僅僅被調(diào)用一次,卻能夠返回兩次,它可能有三種不同的返回值:
1. 在父進(jìn)程中,fork返回新創(chuàng)建子進(jìn)程的進(jìn)程ID;
2.在子進(jìn)程中,fork返回0;
3.如果出現(xiàn)錯(cuò)誤,fork返回一個(gè)負(fù)值;
fork出錯(cuò)可能有兩種原因:(1)當(dāng)前的進(jìn)程數(shù)已經(jīng)達(dá)到了系統(tǒng)規(guī)定的上限,這時(shí)errno的值被設(shè)置為EAGAIN。(2)系統(tǒng)內(nèi)存不足,這時(shí)errno的值被設(shè)置為ENOMEM。
接下來我們來看看APUE2中對(duì)fork的說明:
The new process created by fork is called the child process. This function is called once but returns twice. The only difference in the returns is that the return value in the child is 0, whereas the return value in the parent is the process ID of the new child. The reason the child‘s process ID is returned to the parent is that a process can have more than one child, and there is no function that allows a process to o^ain the process IDs of its children. The reason fork returns 0 to the child is that a process can have only a single parent, and the child can always call getppid to o^ain the process ID of its parent. (Process ID 0 is reserved for use by the kernel, so it‘s not possible for 0 to be the process ID of a child.)
被fork創(chuàng)建的新進(jìn)程叫做自進(jìn)程。fork函數(shù)被調(diào)用一次,卻兩次返回。返回值唯一的區(qū)別是在子進(jìn)程中返回0,而在父進(jìn)程中返回子進(jìn)程的pid。在父進(jìn)程中要返回子進(jìn)程的pid的原因是父進(jìn)程可能有不止一個(gè)子進(jìn)程,而一個(gè)進(jìn)程又沒有任何函數(shù)可以得到他的子進(jìn)程的pid。
Both the child and the parent continue executing with the instruction that follows the call to fork. The child is a copy of the parent. For example, the child gets a copy of the parent‘s data space, heap, and stack. Note that this is a copy for the child; the parent and the child do not share these portions of memory. The parent and the child share the text segment (Section 7.6).[!--empirenews.page--]
子進(jìn)程和父進(jìn)程都執(zhí)行在fork函數(shù)調(diào)用之后的代碼,子進(jìn)程是父進(jìn)程的一個(gè)拷貝。例如,父進(jìn)程的數(shù)據(jù)空間、堆??臻g都會(huì)給子進(jìn)程一個(gè)拷貝,而不是共享這些內(nèi)存。
Current implementations don‘t perform. a complete copy of the parent‘s data, stack, and heap, since a fork is often followed by an exec. Instead, a technique called copy-on-write (COW) is used. These regions are shared by the parent and the child and have their protection changed by the kernel to read-only. If either process tries to modify these regions, the kernel then makes a copy of that piece of memory only, typically a "page" in a virtual memory system. Section 9.2 of Bach [1986] and Sections 5.6 and 5.7 of McKusick et al. [1996] provide more detail on this feature.
我們來給出詳細(xì)的注釋
#include
#include
int main(void)
{
pid_t pid;
int count=0;
/*此處,執(zhí)行fork調(diào)用,創(chuàng)建了一個(gè)新的進(jìn)程, 這個(gè)進(jìn)程共享父進(jìn)程的數(shù)據(jù)和堆??臻g等,這之后的代碼指令為子進(jìn)程創(chuàng)建了一個(gè)拷貝。 fock 調(diào)用是一個(gè)復(fù)制進(jìn)程,fock 不象線程需提供一個(gè)函數(shù)做為入口, fock調(diào)用后,新進(jìn)程的入口就在 fock的下一條語句。*/
pid = fork();
/*此處的pid的值,可以說明fork調(diào)用后,目前執(zhí)行的是父進(jìn)程還是子進(jìn)程*/
printf( "Now, the pid returned by calling fork() is %dn", pid );
if ( pid>0 )
{
/*當(dāng)fork在子進(jìn)程中返回后,fork調(diào)用又向父進(jìn)程中返回子進(jìn)程的pid, 如是該段代碼被執(zhí)行,但是注意的事,count仍然為0, 因?yàn)楦高M(jìn)程中的count始終沒有被重新賦值, 這里就可以看出子進(jìn)程的數(shù)據(jù)和堆??臻g和父進(jìn)程是獨(dú)立的,而不是共享數(shù)據(jù)*/
printf( "This is the parent process,the child has the pid:%dn", pid );
printf( "In the parent process,count = %dn", count );
}
else if ( !pid )
{ /*在子進(jìn)程中對(duì)count進(jìn)行自加1的操作,但是并沒有影響到父進(jìn)程中的count值,父進(jìn)程中的count值仍然為0*/
printf( "This is the child process.n");
printf( "Do your own things here.n" );
count++;
printf( "In the child process, count = %dn", count );
}
else
{
printf( "fork failed.n" );
}
return 0;
}
也就是說,在Linux下一個(gè)進(jìn)程在內(nèi)存里有三部分的數(shù)據(jù),就是"代碼段"、"堆棧段"和"數(shù)據(jù)段"。"代碼段",顧名思義,就是存放了程序代碼的數(shù)據(jù),假如機(jī)器中有數(shù)個(gè)進(jìn)程運(yùn)行相同的一個(gè)程序,那么它們就可以使用相同的代碼段。"堆棧段"存放的就是子程序的返回地址、子程序的參數(shù)以及程序的局部變量。而數(shù)據(jù)段則存放程序的全局變量,常數(shù)以及動(dòng)態(tài)數(shù)據(jù)分配的數(shù)據(jù)空間(比如用malloc之類的函數(shù)取得的空間)。系統(tǒng)如果同時(shí)運(yùn)行數(shù)個(gè)相同的程序,它們之間就不能使用同一個(gè)堆棧段和數(shù)據(jù)段。
仔細(xì)分析后,我們就可以知道:
一個(gè)程序一旦調(diào)用fork函數(shù),系統(tǒng)就為一個(gè)新的進(jìn)程準(zhǔn)備了前述三個(gè)段,首先,系統(tǒng)讓新的進(jìn)程與舊的進(jìn)程使用同一個(gè)代碼段,因?yàn)樗鼈兊某绦蜻€是相同的,對(duì)于數(shù)據(jù)段和堆棧段,系統(tǒng)則復(fù)制一份給新的進(jìn)程,這樣,父進(jìn)程的所有數(shù)據(jù)都可以留給子進(jìn)程,但是,子進(jìn)程一旦開始運(yùn)行,雖然它繼承了父進(jìn)程的一切數(shù)據(jù),但實(shí)際上數(shù)據(jù)卻已經(jīng)分開,相互之間不再有影響了,也就是說,它們之間不再共享任何數(shù)據(jù)了。
fork()不僅創(chuàng)建出與父進(jìn)程代碼相同的子進(jìn)程,而且父進(jìn)程在fork執(zhí)行點(diǎn)的所有上下文場(chǎng)景也被自動(dòng)復(fù)制到子進(jìn)程中,包括:
——全局和局部變量
——打開的文件句柄
——共享內(nèi)存、消息等同步對(duì)象
而如果兩個(gè)進(jìn)程要共享什么數(shù)據(jù)的話,就要使用另一套函數(shù)(shmget,shmat,shmdt等)來操作?,F(xiàn)在,已經(jīng)是兩個(gè)進(jìn)程了,對(duì)于父進(jìn)程,fork函數(shù)返回了子程序的進(jìn)程號(hào),而對(duì)于子程序,fork函數(shù)則返回零,這樣,對(duì)于程序,只要判斷fork函數(shù)的返回值,就知道自己是處于父進(jìn)程還是子進(jìn)程中。