relevel vs relevel_by_frequency
I am using H2O's Python API to construct a GLM. To set the reference level for a categorical input I use
relevel(). Based on the documentation it would seem as though
relevel_by_frequency() would accomplish the same thing assuming I want the most frequency level as my base. What I find strange is that the GLM coefficient estimates are different depending on if I use
relevel('most frequency category') or
relevel_by_frequency(). Is this behavior expected/understandable?