Python code works as below for pronunciation score. I just want to know your scoring routines works like below (no need to be exactly the same). This way I will not required to port all python code (or even a single line). It will save a lot of my time.
Python Scoring Routines Text Dependent:
This method is based on exemplars for each phrase. Initially, mean acoustic score, mean duration along with deviations are calculated for each of the phone in the phrase based on exemplar recordings (acoustic scores for each phoneme from audio file can be calculated using pocketsphinx). Now, given the test recording, each phone in the phrase is then compared with exemplar statistics. After that, z-scores are calculated and then normalized scores are calculated based on maximum and minimum of z-scores from exemplar recordings. All phone scores are aggregated to get word score and then all word scores are aggregated with POS weight to get complete phrase score.
You can find more details here: