Looking for crosswords dataset

Hello all,

I'm writing a crossword generator and the generated crosswords are not looking very natural compared to real ones. I am considering using machine learning to learn how to select the position each new word should be put on the grid.

I have two issues:

I can't think of a generic machine learning technique that could be used. This is quite all right though, I think I can get away with using a novel stochastic model to describe the process and fit the data to it with a genetic algorithm (or some other metaheuristic technique).
I can't seem to find a dataset to work with. The only datasets I have found on the Internet are ones with lists of words. For my needs, I require a dataset showing the start and end positions of words on the grid, I don't even really need it to contain the words themselves. Does anyone know if such a dataset exist and if so how one would go about getting their hands on it?

Thanks you in advance!

submitted by Naurgul
[link][4 comments]

Looking for crosswords dataset

Trending Articles

Practice Sheet of Right form of verbs for HSC Students

NCERT Solutions for Class 9th Sanskrit Chapter 2 अविवेकः परमापदां पदम्

pinout ecu b5vf 18881a

Stories • Goddess Stepmom

BQ40Z80EVM-020: Installation problems with Battery Management Studio Software...

Cops bust UVF goon Matthews at east Belfast gym

* Start SLD Registration * Failed to open HTTP connection

Practical Research 2 DLP for SHS

South Sudan: CCM VACANCY FOR Primary Health Care Supervisor (PHCS) – SOUTH SUDAN

Sarah Samis, Emil Bove III

VMOU RSCIT Result 2017, RSCIT Result VMOU rkcl.vmou.ac.in Name Wise

IP400 Series Phones Fail to Connect to CAS

Who's been in the courts?

LSI SMIS на ESXi 6.7

MDG F: Cost Centre Hierarchy - File upload

FUNG: ROMELIA MARIA

あいみょん (Aimyong) –瞬間的シックスセンス [FLAC 24bit/48kHz]

Error when updating pager_heading in Views Module - "A valid cache entry...

Re: No option for 'Guest Isolation' in VMware Workstation 16 player

Burbank Police Log: May 16 – May 22