In recent years, the research of face age features has achieved rapid development driven by deep learning. The faces generated by the Conditional Adversarial Auto-encoder (CAAE) model are not only highly credible, but also closer to the target age. However, there are many problems, such as low resolution of human face image generation and poor local feature retention effect of human face features. To this end, this paper improves on the CAAE network. Firstly, referring to the LSGAN network structure, the 4 convolution layers of the encoder are added to 5 layers and the 4 convolution layers of the generator are added to 7 layers. Secondly, on the basis of the original loss function, the image gradient difference loss function is added to ensure the output face image quality. Meanwhile, the data set were preprocessed for face correction. Finally, this paper performs face similarity analysis on the Eye-key platform and contrasts the generated image quality using structural similarity and peak signal to noise ratio metrics. In addition, the generated results were tested for their robustness. The experimental results show that the average similarity of faces generated by the Improved Conditional Adversarial Auto-encoder (I-CAAE) network was increased by 3.9. And the average peak signal to noise ratio of the generated pictures was reduced by 1.8. Confirming the superiority of the proposed method.