In fact, k(yi) does not need to be a Gaussian density. It could be a B-spline with local support. In the limiting case of zeroeth and first degree B-splines, maximising this cost function is equivalent to generating a simple histogram of the data. E has a simple relationship with the entropy.
The bias correction model will also fit within this framework:
E = -log{P(y|,)} = -i log{i() k k k(i()yi)}