-
Hi @rwightman here is the config i am using, following Table 2 from the article:
i am getting 79.6% -+ 0.05% Small differences like this can sometimes come even from different versions of pytroch\NGC (mixed-precision implementation) p.s. |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
@mrT23 closer matching the paper A2 run add remode doesn't need to be specified if prob is 0 |
Beta Was this translation helpful? Give feedback.
-
Hi, @mrT23 You increased the A2 procedure, from 79.6% to 80.1%? Hope you give me some configs or tricks to reproduce the result. |
Beta Was this translation helpful? Give feedback.
@mrT23 closer matching the paper A2 run add
--bce-target-thresh 0.2 --aug-repeats 3
... my lamb impl works, but you can also use fusedlamb from apex if you're able, most of the experiments were run with fusedlamb for the slight throughput / mem gains for GPUremode doesn't need to be specified if prob is 0