百度智能云

All Product Document

          Cloud Monitor

          Qianfan Large Model Platform ModelBuilder

          Qianfan Large Model Platform ModelBuilder includes two types of monitor objects: preset services (System) and custom services (Custom). The list of monitor metrics for instance monitor is as follows:

          Pre-configured services (System)

          Metric name (English) Metric name (Chinese) Unit Dimension Remarks
          SystemInternalErrorCode System internal error Time error_code,uri,app_id
          AuthErrorCode Authentication error Time error_code,uri,app_id
          UserInputErrorCode User input error Time error_code,uri,app_id
          QuotaExceededErrorCode Quota over-limit error Time error_code,uri,app_id
          PluginErrorCode Plugin error Time error_code,uri,app_id
          TokenizerErrorCode Tokenizer error Time error_code,uri,app_id
          ImageTextErrorCode Image-text related error Time error_code,uri,app_id
          ServiceErrorCode Other service errors Time error_code,uri,app_id
          PromptOptimizationErrorCode Prompt optimization service error Time error_code,uri,app_id
          TPMRateLimit TPM limit TPM uri,app_id
          TPM TPM TPM uri,app_id
          AvailableTPM TPM margin TPM uri,app_id
          RPMRateLimit RPM limit RPM uri,app_id
          RPM RPM RPM uri,app_id
          AvailableRPM RPM margin RPM uri,app_id
          QPS QPS QPS uri,app_id
          TimeToFirstTokenAVG Average first token latency ms uri,app_id
          LatencyAVG Average full sentence latency ms uri,app_id

          Custom services (Custom)

          Metric name (English) Metric name (Chinese) Unit Dimension Remarks
          SystemInternalErrorCode System internal error Time error_code,uri,app_id
          AuthErrorCode Authentication error Time error_code,uri,app_id
          UserInputErrorCode User input error Time error_code,uri,app_id
          QuotaExceededErrorCode Quota over-limit error Time error_code,uri,app_id
          PluginErrorCode Plugin error Time error_code,uri,app_id
          TokenizerErrorCode Tokenizer error Time error_code,uri,app_id
          ImageTextErrorCode Image-text related error Time error_code,uri,app_id
          ServiceErrorCode Other service errors Time error_code,uri,app_id
          PromptOptimizationErrorCode Prompt optimization service error Time error_code,uri,app_id
          TPMRateLimit TPM limit TPM uri,app_id
          TPM TPM TPM uri,app_id
          AvailableTPM TPM margin TPM uri,app_id
          RPMRateLimit RPM limit RPM uri,app_id
          RPM RPM RPM uri,app_id
          AvailableRPM RPM margin RPM uri,app_id
          QPS QPS QPS uri,app_id
          TimeToFirstTokenAVG Average first token latency ms uri,app_id
          LatencyAVG Average full sentence latency ms uri,app_id
          Previous
          Intelligent Big Data
          Next
          Network