ArnoF
26th June 2007, 16:31
Hi everybody!
I am just implementing PAFF support into a H.264 decoder as my Master's thesis (no, sorry, it's not x264), but I am having trouble understanding how the inter prediction works when fields and frames are mixed in a video stream. Judging from the DPB code in the JM reference decoder, it seems like frames are split into its two fields so that these can be used later for decoding frames and consequently a complementary field pair is combined into a full frame so that this can be used for decoding frames later.
However, looking at the standard in chapter 8.4.1, this assumption seems to be wrong as it talks about Frm_To_Fld modes and such, so that apparently a single fields is still used when these are referenced from a frame. I do not fully understand how this works, as obviously fields only have half the vertical resolution as frames.
Could somebody clarify how this works? It would help me understanding the standard a lot more.
Thank you for your time!
Regards
Arno
I am just implementing PAFF support into a H.264 decoder as my Master's thesis (no, sorry, it's not x264), but I am having trouble understanding how the inter prediction works when fields and frames are mixed in a video stream. Judging from the DPB code in the JM reference decoder, it seems like frames are split into its two fields so that these can be used later for decoding frames and consequently a complementary field pair is combined into a full frame so that this can be used for decoding frames later.
However, looking at the standard in chapter 8.4.1, this assumption seems to be wrong as it talks about Frm_To_Fld modes and such, so that apparently a single fields is still used when these are referenced from a frame. I do not fully understand how this works, as obviously fields only have half the vertical resolution as frames.
Could somebody clarify how this works? It would help me understanding the standard a lot more.
Thank you for your time!
Regards
Arno