At the risk of clobbering this post, I have start another post:
But yes, the testing flow for an addon, and being regressable, particularly the ability to run against multiple versions of blender, makes it a different beast than the testing flow already contained in the blender project.
Maybe the name of this post is a little generic and I am misdirecting the idea of the work.
Please have a look those and if you have any observations I would love to hear (at the link above )