Interesting results for Highway Nets on ImageNet with A2 #1197
Unanswered
flukeskywalker
asked this question in
Show and tell
Replies: 1 comment 2 replies
-
@flukeskywalker any code to look at re the model changes for the highway variant? I assume based on resnet models for depths / stages / but the widths above and 'highway network' style gating? It'd be of interest to include, especially if the runtime (throughput, memory) is competitive with ResNet and similar |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Thanks to timm and @rwightman's very nicely written paper on training recipes, I was able to directly compare Residual connections to Highways on ImageNet-val over the last couple of weeks, and got the results below. I find it interesting that just switching to Highway boosts the acc by 0.5% in each case, while the learned representations are smaller. Moreover, with Highway+A2 I get equal or better acc. than ResNet+A1. I'm curious to know what others think. Are these results interesting, unexpected, or useful to you?
top-1 accuracy
top-1 accuracy
Beta Was this translation helpful? Give feedback.
All reactions