2020年1月6日 14:43:29go评论94阅读模式

英文:

Format subject email

问题

# 我想恢复电子邮件的“SUBJECT”值

import imaplib
import os
import email

email_user = 'xxxxxxx@xxxxxxx'
email_pass = 'xxxxxxxx'

M = imaplib.IMAP4_SSL('imap.gmail.com', 993)
M.login(email_user, email_pass)
M.select('INBOX')

typ, message_numbers = M.search(None, 'ALL')

num = b'2420'
typ, data = M.fetch(num, '(RFC822)')

raw_email = data[0][1].decode('utf-8')
email_message = email.message_from_string(raw_email)

print(email_message['Subject'])

该值为

=?UTF-8?Q?=5BNAS=5FLEBARS=5D_Active_Backup_for_Business_=2D_La_t=C3=A2che_?=
=?UTF-8?Q?de_sauvegarde_DBS_=2D_SIDEXIS_sur_NAS=5FLEBARS_est_termin=C3=A9e?=

但我想要这种编码

[NAS_LEBARS] Active Backup for Business - La t&#226;che de sauvegarde DBS - SIDEXIS sur NAS_LEBARS est termin&#233;e

谢谢

英文:

I want to recover the value "SUBJECT" of an email

import imaplib
import os
import email

email_user = &#39;xxxxxxx@xxxxxxx&#39;
email_pass = &#39;xxxxxxxx&#39;

M = imaplib.IMAP4_SSL(&#39;imap.gmail.com&#39;, 993)
M.login(email_user, email_pass)
M.select(&#39;INBOX&#39;)

typ, message_numbers = M.search(None, &#39;ALL&#39;)

num = b&#39;2420&#39;
typ, data = M.fetch(num, &#39;(RFC822)&#39;)

raw_email = data[0][1].decode(&#39;utf-8&#39;)
email_message = email.message_from_string(raw_email)

print(email_message[&#39;Subject&#39;])

the value is

=?UTF-8?Q?=5BNAS=5FLEBARS=5D_Active_Backup_for_Business_=2D_La_t=C3=A2che_?=
=?UTF-8?Q?de_sauvegarde_DBS_=2D_SIDEXIS_sur_NAS=5FLEBARS_est_termin=C3=A9e?=

but i want this encode

[NAS_LEBARS] Active Backup for Business - La t&#226;che de sauvegarde DBS - SIDEXIS sur NAS_LEBARS est termin&#233;e

Thanks

答案1

得分: -1

这基本上做了你需要的事情，它：

获取每个编码行
提取需要解码的部分
转换内容，
- 用空格替换每个 ''_''
- 用具有相应十六进制代码的字节替换每个 ''=XX''
- 保留所有其他字符不变
将整个结果解码为UTF-8字节数组

import re

subject = [
    '&#39;=?UTF-8?Q?=5BNAS=5FLEBARS=5D_Active_Backup_for_Business_=2D_La_t=C3=A2che_?=&#39;',
    '&#39;=?UTF-8?Q?de_sauvegarde_DBS_=2D_SIDEXIS_sur_NAS=5FLEBARS_est_termin=C3=A9e?=&#39;'
]


def convert_content(content):
    iter_content = iter(content)
    try:
        while True:
            ch = next(iter_content)
            if ch == '&#39;_&#39;:
                yield b' '
            elif ch == '&#39;=&#39;:
                yield bytearray.fromhex(next(iter_content)+next(iter_content))
            else:
                yield ch.encode('utf-8')
    except StopIteration:
        pass


def process(data):
    for line in data:
        m = re.match(r'=\?(?:utf|UTF)-8\?(?:q|Q)\?(.*)\?=&#39;, line)
        yield b''.join(convert_content(m.group(1))).decode('utf-8')


print(''.join(process(subject)))

输出：

[NAS_LEBARS] Active Backup for Business - La t&#226;che de sauvegarde DBS - SIDEXIS sur NAS_LEBARS est termin&#233;e

英文:

This pretty much does what you need, it:

takes each encoded line
extracts the part that needs to be decoded
converts that,
- replacing each '_' with a space
- replacing each '=XX' with the byte with that hex code
- leaving all other characters as is
decodes the entire result as a UTF-8 bytes array

import re

subject = [
    &#39;=?UTF-8?Q?=5BNAS=5FLEBARS=5D_Active_Backup_for_Business_=2D_La_t=C3=A2che_?=&#39;,
    &#39;=?UTF-8?Q?de_sauvegarde_DBS_=2D_SIDEXIS_sur_NAS=5FLEBARS_est_termin=C3=A9e?=&#39;
]


def convert_content(content):
    iter_content = iter(content)
    try:
        while True:
            ch = next(iter_content)
            if ch == &#39;_&#39;:
                yield b&#39; &#39;
            elif ch == &#39;=&#39;:
                yield bytearray.fromhex(next(iter_content)+next(iter_content))
            else:
                yield ch.encode(&#39;utf-8&#39;)
    except StopIteration:
        pass


def process(data):
    for line in data:
        m = re.match(r&#39;=\?(?:utf|UTF)-8\?(?:q|Q)\?(.*)\?=&#39;, line)
        yield b&#39;&#39;.join(convert_content(m.group(1))).decode(&#39;utf-8&#39;)


print(&#39;&#39;.join(process(subject)))

Output:

[NAS_LEBARS] Active Backup for Business - La t&#226;che de sauvegarde DBS - SIDEXIS sur NAS_LEBARS est termin&#233;e

通过集体智慧和协作来改善编程学习和解决问题的方式。致力于成为全球开发者共同参与的知识库，让每个人都能够通过互相帮助和分享经验来进步。

格式化主题电子邮件

问题

答案1

Error b'' trying to install psycopg2 on AIX 7.2

Python混合使用asyncio和线程

评估输出不一致

为什么突然导入与之前完全相同的 Python 模块变得如此缓慢？

What's the correct way to type hint an empty list as a literal in python?

如何在Highcharts Gantt中更改本地化的星期名称

如何在同一个流中使用多个过滤器和映射函数？

如何使用Map/Set来将代码优化到O(n)？

.NET MAUI Android在GitHub Actions上构建失败，错误代码为1。

如何在Playwright视觉比较中屏蔽多个定位器？

在C++中，可以使用可变模板参数来检索类型的内部类型。

selenium.common.exceptions.StaleElementReferenceException: Message: stale element reference: stale element not found

Creating and opening a URL to log in to Website via Basic Auth with Robot Framework/Selenium (Python)

AG Grid 在上下文菜单中以大文本形式打开

发表评论