Október 28, 2009

Odosielanie mailu s diakritikou z Pythonu

Jaj. To som zase mal veľkú radosť, keď som zistil, že TurboMail má problémy s odosielaním mailov s diakritikou. Ja chápem, že ľudí v anglofónnych krajinách príliš netrápia nejaké tie srandovné veci nad písmenkami. Bez diakritiky je aplikácia pre Európu bezcenná.

Tak, dosť lamentu. Ako poslať z Pythonu mail v UTF-8?

Tu je kód, ktorý som prevzal od Mariusa Gedminasa a rozšíril som ho o pár riadkov, aby bol priamo spustiteľný.

# -*- coding: utf-8 -*-
from smtplib import SMTP
from email.MIMEText import MIMEText
from email.Header import Header
from email.Utils import parseaddr, formataddr

def send_email(sender, recipient, subject, body):
    """Send an email. All arguments should be Unicode strings (plain ASCII works as well). Only the real name part of sender and recipient addresses may contain non-ASCII characters. The email will be properly MIME encoded and delivered though SMTP to localhost port 25. This is easy to change if you want something different. The charset of the email will be the first one out of US-ASCII, ISO-8859-1 and UTF-8 that can represent all the characters occurring in the email. """

    # Header class is smart enough to try US-ASCII, then the charset we
    # provide, then fall back to UTF-8.
    header_charset = 'ISO-8859-1'

    # We must choose the body charset manually
    for body_charset in 'US-ASCII', 'ISO-8859-1', 'UTF-8':
        try:
            body.encode(body_charset)
        except UnicodeError:
            pass
        else:
            break

    # Split real name (which is optional) and email address parts
    sender_name, sender_addr = parseaddr(sender)
    recipient_name, recipient_addr = parseaddr(recipient)

    # We must always pass Unicode strings to Header, otherwise it will
    # use RFC 2047 encoding even on plain ASCII strings.
    sender_name = str(Header(unicode(sender_name), header_charset))
    recipient_name = str(Header(unicode(recipient_name), header_charset))

    # Make sure email addresses do not contain non-ASCII characters
    sender_addr = sender_addr.encode('ascii')
    recipient_addr = recipient_addr.encode('ascii')

    # Create the message ('plain' stands for Content-Type: text/plain)
    msg = MIMEText(body.encode(body_charset), 'plain', body_charset)
    msg['From'] = formataddr((sender_name, sender_addr))
    msg['To'] = formataddr((recipient_name, recipient_addr))
    msg['Subject'] = Header(unicode(subject), header_charset)

    # Send the message via SMTP to localhost:25
    smtp = SMTP("localhost")
    smtp.sendmail(sender, recipient, msg.as_string())
    smtp.quit()

send_email('odosielatel@sinusgear.com',
 'prijemca@sinusgear.com',
 u'Guľôčka v jamôčke',
 u"""Lečo, haruľa, šašlík.
A ako dopadne odosielanie ďalšieho riadku? Bude na novom riadku.
A čo tak odoslať trošku dlhší reťazec znakov, než obyčajných 80, aby sme videli, ako si s tým mailer programy poradia. """)

Všetko. Stačí uložiť do send.py a zavolať: python send.py

Tisíce ďakovných listov od spokojných Pythonierov a neskonalá vďaka Mariusovi.

  • Preklady

  • Odkazy

  • Twitter

    Follow @jurajmichalek on twitter.

  • Štítky

  • Rubriky

  • Komentáre