2023年7月17日 19:50:27go评论82阅读模式

英文:

Azure B2C TrustFrameworkLocalization.xml localization tool

问题

我有一个名为TrustFrameworkLocalization.xml的文件，其中包含本地化字符串，目前只有英语，但字符串的数量相当多。我的本地化团队的同事不是技术人员，因此他们无法正确地修改XML文件。是否有任何工具，他们可以使用它来本地化B2C字符串并生成TrustFrameworkLocalization.xml文件？类似于JSON的BabelEdit之类的工具？

英文:

I have TrustFrameworkLocalization.xml file, that contains localized strings, for now only English, but amount of strings is quite big. My colleagues from localization team are not technical people. So, they are not able to modify XML file in the right way. Is there any tool, where they could localize B2C strings and generate TrustFrameworkLocalization.xml file? Something like BabelEdit for JSON?

答案1

得分: 1

以下是您要翻译的内容：

这是导入脚本：

import pandas as pd
import os
import xml.etree.ElementTree as ET
from xml.dom import minidom
import argparse
import numpy as np
def prettify(elem):
    """返回元素的漂亮格式化XML字符串。"""
    rough_string = ET.tostring(elem, 'utf-8')
    reparsed = minidom.parseString(rough_string)
    return reparsed.toprettyxml(indent="  ")
def process_file(file_path):
    # 加载工作簿
    workbook = pd.read_excel(file_path, sheet_name=None)
    base_name = os.path.basename(file_path)
    base_name_without_ext = os.path.splitext(base_name)[0]
    # 为合并输出创建根元素
    combined_resources = ET.Element('LocalizedResources')
    # 处理每个工作表
    for sheet_name, data in workbook.items():
        # 创建根元素
        resources = ET.Element('LocalizedResources')
        resources.set(
            'Id', f'api.{base_name_without_ext}.{sheet_name.lower()}')
        # 创建LocalizedStrings元素
        strings = ET.SubElement(resources, 'LocalizedStrings')
        # 处理每一行
        for i, row in data.iterrows():
            localized_string = ET.SubElement(strings, 'LocalizedString')
            # 根据行数据添加属性，将NaN转换为空字符串
            localized_string.set('ElementType', str(
                row['ElementType']) if pd.notna(row['ElementType']) else '')
            if pd.notna(row['ElementId']):
                localized_string.set('ElementId', str(row['ElementId']))
            localized_string.set('StringId', str(
                row['StringId']) if pd.notna(row['StringId']) else '')
            localized_string.text = str(
                row['Value']) if pd.notna(row['Value']) else ''
        # 将处理后的资源添加到combined_resources
        combined_resources.append(resources)
    # 写入单个合并的XML文件
    xml_str = prettify(combined_resources)
    with open(f"{base_name_without_ext}_combined.xml", "w") as f:
        f.write(xml_str)
if __name__ == '__main__':
    parser = argparse.ArgumentParser(
        description='处理XLSX文件并生成合并的XML文件。')
    parser.add_argument('-f', '--file', required=True,
                        help='输入XLSX文件的路径。')
    args = parser.parse_args()
    process_file(args.file)

这是导出脚本：

import xlsxwriter
import argparse
import xml.etree.ElementTree as ET
import pandas as pd
# 定义命令行参数
parser = argparse.ArgumentParser()
parser.add_argument("-f", "--file", help="XML文件的路径")
parser.add_argument("-m", "--mode", help="模式：导入或导出")
args = parser.parse_args()
print(f"解析文件：{args.file}")
tree = ET.parse(args.file)
root = tree.getroot()
localized_resources = root.findall(
    ".//{http://schemas.microsoft.com/online/cpim/schemas/2013/06}LocalizedResources")
# 创建新的Excel文件并添加工作表。
print("正在打开工作簿...")
print(f"找到{len(localized_resources)}个本地化资源")
writer = pd.ExcelWriter('AzureAD_B2C_Translations.xlsx', engine='xlsxwriter')
for resource in localized_resources:
    resource_id = resource.attrib['Id']
    localized_strings = resource.findall(
        ".//{http://schemas.microsoft.com/online/cpim/schemas/2013/06}LocalizedString")
    print(
        f"资源{resource.attrib['Id']}包含{len(localized_strings)}个本地化字符串...")
    output = []
    for string in localized_strings:
        items = string.items()
        items.append(("Value", string.text))  # 类型：忽略
        output.append(dict(items))
    df = pd.DataFrame(output)
    df.to_excel(writer, sheet_name=resource_id[4:],
                startrow=0, header=True, index=False)
writer.close()

首先，我想要导出脚本来执行两者，但我发现只让GPT编写两个脚本更容易。让它编写导入脚本只花了5分钟，使用了良好的提问方式。

英文:

I asked GPT-4 to write a python script which will for each LocalizedResources Tag in the Policy output an XLSX File that has as it's name the Id of that tag, with the LocalizedString Attributes as columns. It only took 1-2 feedback rounds for it to work. I then took the English example and let people copy the sheet and localize the "Value" columun and give the sheet a name like "fr". Or you just copy & paste the whole Value column into DeepL and it will localize the entire column.

I then asked it to write another script which will read in my excel and output an XML file with XML that looks like it is supposed to and construct a name for each "LocalizedResources" from api + filename + sheetname. I showed it how the XML looks like and it worked after 2 tries. Just had to tell it to remove elements where e.g. the StringId is empty. Then I just copy pasted the whole thing back into the policy.

This is the import script.

import pandas as pd
import os
import xml.etree.ElementTree as ET
from xml.dom import minidom
import argparse
import numpy as np
def prettify(elem):
    &quot;&quot;&quot;Return a pretty-printed XML string for the Element.&quot;&quot;&quot;
    rough_string = ET.tostring(elem, &#39;utf-8&#39;)
    reparsed = minidom.parseString(rough_string)
    return reparsed.toprettyxml(indent=&quot;  &quot;)
def process_file(file_path):
    # Load the workbook
    workbook = pd.read_excel(file_path, sheet_name=None)
    base_name = os.path.basename(file_path)
    base_name_without_ext = os.path.splitext(base_name)[0]
    # Create root element for combined output
    combined_resources = ET.Element(&#39;LocalizedResources&#39;)
    # Process each sheet
    for sheet_name, data in workbook.items():
        # Create root element
        resources = ET.Element(&#39;LocalizedResources&#39;)
        resources.set(
            &#39;Id&#39;, f&#39;api.{base_name_without_ext}.{sheet_name.lower()}&#39;)
        # Create LocalizedStrings element
        strings = ET.SubElement(resources, &#39;LocalizedStrings&#39;)
        # Process each row
        for i, row in data.iterrows():
            localized_string = ET.SubElement(strings, &#39;LocalizedString&#39;)
            # Add attributes based on row data, convert NaN to &#39;&#39;
            localized_string.set(&#39;ElementType&#39;, str(
                row[&#39;ElementType&#39;]) if pd.notna(row[&#39;ElementType&#39;]) else &#39;&#39;)
            if pd.notna(row[&#39;ElementId&#39;]):
                localized_string.set(&#39;ElementId&#39;, str(row[&#39;ElementId&#39;]))
            localized_string.set(&#39;StringId&#39;, str(
                row[&#39;StringId&#39;]) if pd.notna(row[&#39;StringId&#39;]) else &#39;&#39;)
            localized_string.text = str(
                row[&#39;Value&#39;]) if pd.notna(row[&#39;Value&#39;]) else &#39;&#39;
        # Add processed resources to combined_resources
        combined_resources.append(resources)
    # Write to a single combined XML file
    xml_str = prettify(combined_resources)
    with open(f&quot;{base_name_without_ext}_combined.xml&quot;, &quot;w&quot;) as f:
        f.write(xml_str)
if __name__ == &#39;__main__&#39;:
    parser = argparse.ArgumentParser(
        description=&#39;Process an XLSX file and generate a combined XML file.&#39;)
    parser.add_argument(&#39;-f&#39;, &#39;--file&#39;, required=True,
                        help=&#39;Path to the input XLSX file.&#39;)
    args = parser.parse_args()
    process_file(args.file)

That's the export script:

import xlsxwriter
import argparse
import xml.etree.ElementTree as ET
import pandas as pd
# Define command line arguments
parser = argparse.ArgumentParser()
parser.add_argument(&quot;-f&quot;, &quot;--file&quot;, help=&quot;Path to XML file&quot;, )
parser.add_argument(&quot;-m&quot;, &quot;--mode&quot;, help=&quot;Mode: Import or Export&quot;)
args = parser.parse_args()
print(f&quot;Parsing file: {args.file}&quot;)
tree = ET.parse(args.file)
root = tree.getroot()
localized_resources = root.findall(
    &quot;.//{http://schemas.microsoft.com/online/cpim/schemas/2013/06}LocalizedResources&quot;)
# Create an new Excel file and add a worksheet.
print(&quot;Opening Workbook...&quot;)
print(f&quot;Found {len(localized_resources)} localized resources&quot;)
writer = pd.ExcelWriter(&#39;AzureAD_B2C_Translations.xlsx&#39;, engine=&#39;xlsxwriter&#39;)
for resource in localized_resources:
    resource_id = resource.attrib[&#39;Id&#39;]
    localized_strings = resource.findall(
        &quot;.//{http://schemas.microsoft.com/online/cpim/schemas/2013/06}LocalizedString&quot;)
    print(
        f&quot;Resource {resource.attrib[&#39;Id&#39;]} has {len(localized_strings)} localized strings...&quot;)
    output = []
    for string in localized_strings:
        items = string.items()
        items.append((&quot;Value&quot;, string.text))  # type: ignore
        output.append(dict(items))
    df = pd.DataFrame(output)
    df.to_excel(writer, sheet_name=resource_id[4:],
                startrow=0, header=True, index=False)
writer.close()

First I wanted the export script to do both, but I found it easier to just have GPT write two script. Having it write the Import Script literally took 5 minutes with a welll articulated prompt.

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

Azure B2C TrustFrameworkLocalization.xml 本地化工具

问题

答案1

如何在Go中从XML元素（使用结构体标签）中获取文本？

对于我的Android应用程序，我应该将图像转换为矢量（SVG）图像的XML文件吗？

azure AD B2C starter pack, for edit profile custom policy,if the user is signed in, authentication is skipped, how to force the user to authenticate?

通过手动编写的XSD和使用JAXB生成的Java类会导致UnmarshallException异常。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。