ASCII कला उत्पन्न करें

इनपुट के रूप में किसी भी उचित दोषरहित प्रारूप में एक श्वेत-श्याम छवि को देखते हुए , आउटपुट ASCII कला जो इनपुट छवि के जितना संभव हो उतना करीब है।

नियम

केवल लाइनफीड और ASCII बाइट 32-127 का उपयोग किया जा सकता है।
इनपुट छवि को क्रॉप किया जाएगा ताकि छवि के आसपास कोई बाहरी व्हाट्सएप न हो।
प्रस्तुतियाँ 5 मिनट के भीतर पूरे स्कोरिंग कॉर्पस को पूरा करने में सक्षम होनी चाहिए।
केवल कच्चा पाठ स्वीकार्य है; कोई समृद्ध पाठ प्रारूप नहीं।
स्कोरिंग में उपयोग किया जाने वाला फ़ॉन्ट 20-pt लिनक्स लिबर्टिन है ।
आउटपुट टेक्स्ट फ़ाइल, जब नीचे वर्णित छवि में परिवर्तित हो जाती है, तो इनपुट आयाम के समान आयाम होना चाहिए, दोनों आयामों में 30 पिक्सेल के भीतर।

स्कोरिंग

इन छवियों का उपयोग स्कोरिंग के लिए किया जाएगा:

आप यहाँ छवियों का एक ज़िप डाउनलोड कर सकते हैं ।

प्रस्तुतियाँ इस कॉर्पस के लिए अनुकूलित नहीं होनी चाहिए; बल्कि, उन्हें समान आयामों के किसी भी 8 श्वेत-श्याम चित्रों के लिए काम करना चाहिए। यदि मुझे संदेह है कि इन विशिष्ट चित्रों के लिए सबमिशन को अनुकूलित किया जा रहा है, तो मैं कॉर्पस में छवियों को बदलने का अधिकार सुरक्षित रखता हूं।

स्कोरिंग इस स्क्रिप्ट के माध्यम से किया जाएगा:

#!/usr/bin/env python
from __future__ import print_function
from __future__ import division
# modified from http://stackoverflow.com/a/29775654/2508324
# requires Linux Libertine fonts - get them at https://sourceforge.net/projects/linuxlibertine/files/linuxlibertine/5.3.0/
# requires dssim - get it at https://github.com/pornel/dssim
import PIL
import PIL.Image
import PIL.ImageFont
import PIL.ImageOps
import PIL.ImageDraw
import pathlib
import os
import subprocess
import sys

PIXEL_ON = 0  # PIL color to use for "on"
PIXEL_OFF = 255  # PIL color to use for "off"

def dssim_score(src_path, image_path):
    out = subprocess.check_output(['dssim', src_path, image_path])
    return float(out.split()[0])

def text_image(text_path):
    """Convert text file to a grayscale image with black characters on a white background.

    arguments:
    text_path - the content of this file will be converted to an image
    """
    grayscale = 'L'
    # parse the file into lines
    with open(str(text_path)) as text_file:  # can throw FileNotFoundError
        lines = tuple(l.rstrip() for l in text_file.readlines())

    # choose a font (you can see more detail in my library on github)
    large_font = 20  # get better resolution with larger size
    if os.name == 'posix':
        font_path = '/usr/share/fonts/linux-libertine/LinLibertineO.otf'
    else:
        font_path = 'LinLibertine_DRah.ttf'
    try:
        font = PIL.ImageFont.truetype(font_path, size=large_font)
    except IOError:
        print('Could not use Libertine font, exiting...')
        exit()

    # make the background image based on the combination of font and lines
    pt2px = lambda pt: int(round(pt * 96.0 / 72))  # convert points to pixels
    max_width_line = max(lines, key=lambda s: font.getsize(s)[0])
    # max height is adjusted down because it's too large visually for spacing
    test_string = 'ABCDEFGHIJKLMNOPQRSTUVWXYZ'
    max_height = pt2px(font.getsize(test_string)[1])
    max_width = pt2px(font.getsize(max_width_line)[0])
    height = max_height * len(lines)  # perfect or a little oversized
    width = int(round(max_width + 40))  # a little oversized
    image = PIL.Image.new(grayscale, (width, height), color=PIXEL_OFF)
    draw = PIL.ImageDraw.Draw(image)

    # draw each line of text
    vertical_position = 5
    horizontal_position = 5
    line_spacing = int(round(max_height * 0.8))  # reduced spacing seems better
    for line in lines:
        draw.text((horizontal_position, vertical_position),
                  line, fill=PIXEL_ON, font=font)
        vertical_position += line_spacing
    # crop the text
    c_box = PIL.ImageOps.invert(image).getbbox()
    image = image.crop(c_box)
    return image

if __name__ == '__main__':
    compare_dir = pathlib.PurePath(sys.argv[1])
    corpus_dir = pathlib.PurePath(sys.argv[2])
    images = []
    scores = []
    for txtfile in os.listdir(str(compare_dir)):
        fname = pathlib.PurePath(sys.argv[1]).joinpath(txtfile)
        if fname.suffix != '.txt':
            continue
        imgpath = fname.with_suffix('.png')
        corpname = corpus_dir.joinpath(imgpath.name)
        img = text_image(str(fname))
        corpimg = PIL.Image.open(str(corpname))
        img = img.resize(corpimg.size, PIL.Image.LANCZOS)
        corpimg.close()
        img.save(str(imgpath), 'png')
        img.close()
        images.append(str(imgpath))
        score = dssim_score(str(corpname), str(imgpath))
        print('{}: {}'.format(corpname, score))
        scores.append(score)
    print('Score: {}'.format(sum(scores)/len(scores)))

स्कोरिंग प्रक्रिया:

प्रत्येक कॉर्पस छवि के लिए सबमिशन चलाएँ, परिणाम को .txtफाइल के रूप में एक ही स्टेम के साथ आउटपुट करते हुए कॉर्पस फ़ाइल (मैन्युअल रूप से किया गया)।
20-बिंदु फ़ॉन्ट का उपयोग करके, व्हाट्सएप को क्रॉप करके प्रत्येक टेक्स्ट फ़ाइल को पीएनजी छवि में परिवर्तित करें।
परिणाम छवि को लैंक्ज़ोस रेज़म्पलिंग का उपयोग करके मूल छवि के आयामों का आकार बदलें।
मूल छवि का उपयोग करके प्रत्येक पाठ छवि की तुलना करें dssim।
प्रत्येक पाठ फ़ाइल के लिए dssim स्कोर आउटपुट करें।
औसत स्कोर का आउटपुट।

संरचनात्मक समानता (वह मीट्रिक जिसके द्वारा dssimस्कोर की गणना की जाती है) चित्रों में मानवीय दृष्टि और वस्तु पहचान के आधार पर एक मीट्रिक है। इसे स्पष्ट रूप से कहने के लिए: यदि दो छवियां मनुष्यों के समान दिखती हैं, तो वे (शायद) से कम स्कोर करेंगे dssim।

जीतने वाला सबमिशन सबसे कम औसत स्कोर के साथ जमा होगा।

^{सम्बंधित}

code-challenge ascii-art image-processing

— मै जाऊं
स्रोत

"ब्लैक एंड व्हाइट" के रूप में "शून्य / एक" या कितने ग्रे स्तर?

— लुइस मेन्डो

@DonMuesli 0 और 1.

— Mego

क्या आप स्पष्ट कर सकते हैं कि " .txtफाइलों के परिणामों को आउटपुट करने" से आपका क्या मतलब है ? क्या प्रोग्राम आउटपुट टेक्स्ट को फ़ाइल में पाइप किया जाना चाहिए या हमें सीधे फाइल आउटपुट करना चाहिए?

— DanTheMan

@DANTheMan या तो स्वीकार्य है। यदि आप STDOUT में आउटपुट करते हैं, तो आउटपुट को स्कोरिंग उद्देश्यों के लिए फ़ाइल में पुनर्निर्देशित करना होगा, हालांकि।

— मेगो

आप संकल्प बाधाओं को निर्दिष्ट नहीं करना चाहिए? अन्यथा, हम कह सकते हैं कि, 10000 से 10000 वर्ण छवि का उत्पादन, जब, नीचे, मूल छवियों को काफी बारीकी से मिलाएगा, और व्यक्तिगत वर्ण अवैध डॉट्स होंगे। अगर आउटपुट इमेज बहुत बड़ी है तो फॉन्ट-साइज़ मायने नहीं रखता।

— डेविड

जावा, स्कोर 0.57058675

यह वास्तव में मेरी पहली बार छवि हेरफेर कर रहा है इसलिए यह अजीब है लेकिन मुझे लगता है कि यह ठीक निकला।

मुझे अपनी मशीन पर काम करने के लिए dssim नहीं मिला, लेकिन मैं PIL का उपयोग करके चित्र बनाने में सक्षम था।

दिलचस्प बात यह है कि फॉन्ट मुझे जावा में बताता है कि मेरे द्वारा उपयोग किए जा रहे प्रत्येक वर्ण की चौड़ाई है 6। आप देख सकते हैं कि मेरा कार्यक्रम FontMetrics::charWidthउन 6सभी पात्रों के लिए है जिनका मैंने उपयोग किया है। {}लोगो एक monospace फ़ॉन्ट में बहुत सभ्य लग रहा है। लेकिन किसी कारण के लिए लाइनें वास्तव में पूर्ण पाठ फ़ाइल में पंक्तिबद्ध नहीं होती हैं। मैं लिगमेंट को दोष देता हूं। (और हाँ, मुझे सही फ़ॉन्ट का उपयोग करना चाहिए।)

मोनोपॉज़्ड फ़ॉन्ट में:

                                                                                      .
                         .,:ff:,                                                   ,:fff::,.
                ,ff .fIIIIIf,                                                         .:fIIIIIf.:f:.
            .,:III: ,ff::                       ..,,            ,,..                      ,:fff, IIII.,
          :IIf,f:,:fff:,                  .:fIIIIIII.          .IIIIIIIf:.                 .,:fff:,ff IIf,
       ,.fIIIf,:ffff,                   ,IIIIIII:,,.            .,,:IIIIIII.                  .:ffff:,IIII,:.
     ,III.::.,,,,,.                     IIIIII:                      ,IIIIII                     ,,,,,.,:,:IIf
     IIIII :ffIIf,                      IIIIII,                      .IIIIII                      :IIIf:,.IIIIf.
  ,II,fIf.:::,..                        IIIIII,                      .IIIIII                       ..,:::,,If::II
  IIIIf.  ,:fII:                       .IIIIII,                      .IIIIII.                       IIff:.  :IIII:
 ::IIIIf:IIIf: .                  ,::fIIIIIII,                        ,fIIIIIIf::,                   ,ffIII,IIIIf,,
:IIf:::    .,fI:                  IIIIIIIII:                            :IIIIIIIIf                  If:,    .::fIIf
 IIIIII, :IIIIf                     .,:IIIIIIf                        fIIIIII:,.                    ,IIIII. fIIIII:
 ,:IIIII ff:,   f,                      IIIIII,                      .IIIIII                      f.  .::f::IIIIf,.
 fIf::,,     ,fIII                      IIIIII,                      .IIIIII                     :III:      ,,:fII.
  fIIIIIIf, :IIIIf   ,                  IIIIII,                      .IIIIII                 .,  ,IIIII. :fIIIIII,
   .:IIIIIII,ff,    :II:                IIIIIIf                      fIIIIII               .fII.   .:ff:IIIIIIf,
     :fffff:,      IIIIIf   ,            :IIIIIIIfff            fffIIIIIII:           ..   IIIII:      ::fffff,
      .fIIIIIIIf:, fIIII,   ,IIf,           ,:ffIIII.          .IIIIff:,          .:fII    fIIII,.:ffIIIIIII:
         ,fIIIIIIIIIf:,     ,IIIII:  .,::,                               .,::,  .IIIIII      ::fIIIIIIIIf:.
             :fffffff,      .fIIIII,   .IIIIIf:                     ,:fIIII:    IIIIII:       :fffffff,
              .:fIIIIIIIIIIIIffffI:      IIIIIIII.                :IIIIIII:     .fIffffIIIIIIIIIIII:,
                   ,:fIIIIIIIIIIIf,       .:fIIIII               ,IIIIIf,        :IIIIIIIIIIIff,.
                         .:ffffffffIIIIIIIIIIIfff:.              ,ffffIIIIIIIIIIIfffffff:,
                             .,:ffIIIIIIIIIIIIIIIIf,   .,,,,.  .:fIIIIIIIIIIIIIIIIff:,.
                                       ....... .,,:fffff:.,:fffff:,.  .......
                                    ..,,:fffIIIIf:,.            .,:fIIIIff::,,..
                                   .IIIIIf:,.                          .,:fIIIII
                                     f,                                      ,f

छवि उपकरण के माध्यम से इसे चलाने के बाद:

वैसे भी, यहाँ वास्तविक कोड है।

//package cad97;

import java.awt.Font;
import java.awt.FontMetrics;
import java.awt.Rectangle;
import java.awt.Toolkit;
import java.awt.image.BufferedImage;
import java.awt.image.Raster;
import java.io.File;
import java.io.FileInputStream;
import java.io.FileNotFoundException;
import java.io.IOException;
import java.io.InputStream;
import java.util.HashMap;
import java.util.Map;
import javax.imageio.ImageIO;

public final class AsciiArt {

    private static final Font LINUX_LIBERTINE = new Font("LinLibertine_DRah", Font.PLAIN, 20);
    private static final FontMetrics LL_METRICS = Toolkit.getDefaultToolkit().getFontMetrics(LINUX_LIBERTINE);
    // Toolkit::getFontMetrics is deprecated, but that's the only way to get FontMetrics without an explicit Graphics environment.
    // If there's a better way to get the widths of characters, please tell me.

    public static void main(String[] args) throws IOException {
        File jar = new java.io.File(AsciiArt.class.getProtectionDomain().getCodeSource().getLocation().getPath());
        if (args.length != 1) {
            String jarName = jar.getName();
            System.out.println("Usage: java -jar " + jarName + " file");
        } else {
            File image = new File(args[0]);
            try (InputStream input = new FileInputStream(image)) {
                String art = createAsciiArt(ImageIO.read(input), LINUX_LIBERTINE, LL_METRICS);
                System.out.print(art); // If you want to save as a file, change this.
            } catch (FileNotFoundException fnfe) {
                System.out.println("Unable to find file " + image + ".");
                System.out.println("Please note that you need to pass the full file path.");
            }
        }
    }

    private static String createAsciiArt(BufferedImage image, Font font, FontMetrics metrics) {
        final int height = metrics.getHeight();
        final Map<Character,Integer> width = new HashMap<>();
        for (char c=32; c<127; c++) { width.put(c, metrics.charWidth(c)); }

        StringBuilder art = new StringBuilder();

        for (int i=0; i<=image.getHeight(); i+=height) {
            final int tempHeight = Math.min(height, image.getHeight()-i);
            art.append(createAsciiLine(image.getSubimage(0, i, image.getWidth(), tempHeight), width));
        }

        return art.toString();
    }

    private static String createAsciiLine(BufferedImage image, Map<Character,Integer> charWidth) {
        if (image.getWidth()<6) return "\n";
        /*
        I'm passing in the charWidth Map because I could use it, and probably a later revision if I
        come back to this will actually use non-6-pixel-wide characters. As is, I'm only using the
        6-pixel-wide characters for simplicity. They are those in this set: { !,./:;I[\]ft|}
        */
        assert charWidth.get(' ') == 6; assert charWidth.get('!') == 6;
        assert charWidth.get(',') == 6; assert charWidth.get('.') == 6;
        assert charWidth.get('/') == 6; assert charWidth.get(':') == 6;
        assert charWidth.get(';') == 6; assert charWidth.get('I') == 6;
        assert charWidth.get('[') == 6; assert charWidth.get('\\') == 6;
        assert charWidth.get(']') == 6; assert charWidth.get('f') == 6;
        assert charWidth.get('t') == 6; assert charWidth.get('|') == 6;

        // Measure whiteness of 6-pixel-wide sample
        Raster sample = image.getData(new Rectangle(6, image.getHeight()));
        int whiteCount = 0;
        for (int x=sample.getMinX(); x<sample.getMinX()+sample.getWidth(); x++) {
            for (int y=sample.getMinY(); y<sample.getMinY()+sample.getHeight(); y++) {
                int pixel = sample.getPixel(x, y, new int[1])[0];
                whiteCount += pixel==1?0:1;
            }
        }

        char next;

        int area = sample.getWidth()*sample.getHeight();

        if (whiteCount > area*0.9) {
            next = ' ';
        } else if (whiteCount > area*0.8) {
            next = '.';
        } else if (whiteCount > area*0.65) {
            next = ',';
        } else if (whiteCount > area*0.5) {
            next = ':';
        } else if (whiteCount > area*0.3) {
            next = 'f';
        } else {
            next = 'I';
        }

        return next + createAsciiLine(image.getSubimage(charWidth.get(','), 0, image.getWidth()-sample.getWidth(), image.getHeight()), charWidth);
    }

}

संकलित करें:

सुनिश्चित करें कि आपके पास JDK स्थापित है
सुनिश्चित करें कि JDK बिन आपके PATH पर है (मेरे लिए C:\Program Files\Java\jdk1.8.0_91\bin)
फ़ाइल को इस रूप में सहेजें AsciiArt.java
javac AsciiArt.java
jar cvfe WhateverNameYouWant.jar AsciiArt AsciiArt.class

उपयोग: java -jar WhateverNameYouWant.jar C:\full\file\path.pngप्रिंट करने के लिए STDOUT

स्रोत फ़ाइल को 1-बिट गहराई से सहेजने के लिए और एक सफेद पिक्सेल के लिए नमूना होने की आवश्यकता है 1।

स्कोरिंग उत्पादन:

corp/board.png: 0.6384
corp/Doppelspalt.png: 0.605746
corp/down.png: 1.012326
corp/img2.png: 0.528794
corp/pcgm.png: 0.243618
corp/peng.png: 0.440982
corp/phi.png: 0.929552
corp/text2image.png: 0.165276
Score: 0.57058675

— CAD97
स्रोत

मुखरता -eaको सक्षम करने के लिए चलाएँ । यह व्यवहार को नहीं बदलेगा (शायद इसे थोड़ी मात्रा में धीमा कर दें) क्योंकि जब वे मूल्यांकन करते हैं falseऔर ये सभी दावे पास हो जाते हैं तो कार्यक्रम को विफल करके काम करते हैं।

— CAD97

आह, मैंने याद किया कि आपने पैकेज घोषणा को हटा दिया है। यह अब काम करता है। आज कुछ मिनट मिलने पर मैं इसे स्कोर करूँगा।

— Mego

Board.png के लिए आउटपुट किसी कारण से केवल 4 लाइनें लंबी है: gist.github.com/Mego/75eccefe555a81bde6022d7eade1424f । वास्तव में, सभी आउटपुट पीपीसीजी लोगो के अपवाद के साथ, जब मैं इसे चलाता हूं तो समय से पहले छंटनी होने लगती है।

— Mego

@ मुझे लगता है कि यह फ़ॉन्ट की ऊंचाई (फॉन्टमेट्रिक्स रिपोर्ट द्वारा 24 px) के साथ करना है। मैंने लाइन लूप को बदल दिया है इसलिए यह एक बहुत कम लाइनों के बजाय एक बहुत कम लाइनों की ओर है, और इसे अब काम करना चाहिए। (बोर्ड में 5 लाइनें हैं)

— CAD97

एक नियम के रूप में यह एल्गोरिथ्म छोटी छवियों के साथ संघर्ष करता है, क्योंकि (यह सोचता है) सभी वर्ण 6px चौड़े और 24px लंबे हैं, और यह सब दिखता है कि उस सुपर-पिक्सेल में कितने पिक्सेल चालू हैं।

— सीएड 97