[P] styleGAN trained on album covers

Smith4242 · 2019-06-20T09:01:23+00:00

Really cool! Would love to see what the GAN produces if it's fed album covers from specific genres!

crazyyfish · 2019-05-05T01:56:10+00:00

It is interesting that it even learns some kind of texts and their arrangement.

CrazyAsparagus · 2019-05-05T06:53:26+00:00

A lot of letters are backwards, is that because you trained on augmented data with image flipping or something?

[deleted] · 2019-05-05T07:31:37+00:00

Did you mirror images as part of data augmentation? A lot of the text has a definitive backwards vibe to them :)

exilhesse · 2019-05-05T07:32:40+00:00

It kind of stresses me out that there is text that I am unable to read. Is this what dyslexia feels like?

CambrianKid · 2019-05-05T02:28:14+00:00

Have you uploaded the model anywhere? I'd love to play around with this.

veqtor · 2019-05-05T09:41:04+00:00

4 days on 1x 2080ti, I've implemented fp16-mixed precision training so it's more like 2x though

https://i.imgur.com/OGcaUKB.jpg

iluvcoder · 2019-05-05T03:14:44+00:00

Hi u/shoeblade, look great, how long (in days) did it take to train?

Veedrac · 2019-05-05T03:39:57+00:00

The text looks amazing.

ZigguratOfUr · 2019-05-05T06:04:56+00:00

Looks better aesthetically than any other stylegan I've seen!

lewis841214 · 2019-05-05T05:23:36+00:00

Is it ok to share the code and the trained data?

NicolasGuacamole · 2019-05-05T01:19:43+00:00

Can you show closest images in the training set compared to your cherry picked samples? Would be interesting to see.

nurijanian · 2019-05-05T04:33:20+00:00

someone should post this in r/oddlyterrifying

gnu-user · 2019-05-05T06:00:54+00:00

This is very cool! Do you have a project on Github or any code available to demonstrate how this was done? Are there any projects / code in particular for those interested in GANs that you recommend?

tough-dance · 2019-05-05T06:56:39+00:00

Maybe you can't go into specifics, but what differentiates the styleGAN? It is just a DCGAN applied to something "styled" or is there some sort of relevant "style structure" or something (much like a WaveGAN is just a 1-Dimensional DCGAN applied to a waveform)

Jonno_FTW · 2019-05-05T08:35:32+00:00

Can you post the training dataset and/or code or model?

fimari · 2019-05-05T09:20:42+00:00

I think before doing this we should extract as much mean as possible by generating for example a well formed SVG out of the cover where text is text with font linked to it, and a gradient is defined as gradient.

The next big step - high quality vectorisation.

b_n · 2019-05-05T09:20:59+00:00

Does it work so well because all the images have the same aspect ratio? I have a database I would to try this on, however the images are all of varying resolutions and aspect ratios

XmintMusic · 2019-05-05T12:23:39+00:00

Is the code or training data available somewhere? I'd love to play with these covers.

Beaster123 · 2019-05-05T15:57:11+00:00

The examples look like 90s album covers.

SaveUser · 2019-05-05T16:53:59+00:00

Do you have your event logfiles / tensorboard graphs?

I was training a similar StyleGAN but ended up with diverging loss for G and D, and a small degree of mode collapse, so I'd be curious to see the stats on yours

mysterEFrank · 2019-05-06T08:09:28+00:00

It learns mop top haircuts

Kilerpoyo · 2019-07-19T22:23:25+00:00

Hi, did you use the Nvidia code?

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS