doc/graph_io.dox
author hegyi
Wed, 27 Jul 2005 11:19:35 +0000
changeset 1596 44897b1ba4e2
parent 1532 aa7428d22aaf
child 1631 e15162d8eca1
permissions -rw-r--r--
Sorry for the previous commit, it was not ready yet, but that damned up arrow... So in this new revision string-double and double-string conversion is corrected to a more C++ way.
alpar@1118
     1
namespace lemon {
deba@1114
     2
/*!
deba@1114
     3
deba@1114
     4
deba@1114
     5
\page graph-io-page Graph Input-Output
deba@1114
     6
athos@1540
     7
The standard graph IO enables one to store graphs and additional maps
athos@1540
     8
(i.e. functions on the nodes or edges) in a flexible and efficient way. 
athos@1540
     9
Before you read this page you should be familiar with LEMON 
athos@1540
    10
\ref graphs "graphs" and \ref maps-page "maps".
deba@1114
    11
deba@1114
    12
\section format The general file format
deba@1114
    13
deba@1532
    14
The file contains sections in the following order:
deba@1114
    15
deba@1114
    16
\li nodeset
deba@1114
    17
\li edgeset
deba@1114
    18
\li nodes
deba@1114
    19
\li edges
deba@1532
    20
\li attributes
deba@1114
    21
athos@1540
    22
Some of these sections can be omitted, but you will basicly need the nodeset
athos@1540
    23
section (unless your graph has no nodes at all) and the edgeset section
athos@1540
    24
(unless your graph has no edges at all). 
athos@1540
    25
athos@1540
    26
The nodeset section describes the nodes of your graph: it identifies the nodes
athos@1540
    27
and gives the maps defined on them, if any. It starts with the
athos@1540
    28
following line:
athos@1522
    29
athos@1522
    30
<tt>\@nodeset</tt>
athos@1522
    31
athos@1522
    32
The next line contains the names of the nodemaps, separated by whitespaces.  Each
athos@1522
    33
following line describes a node in the graph: it contains the values of the
athos@1522
    34
maps in the right order. The map named "id" should contain unique values
athos@1540
    35
because it is regarded as an ID-map. These ids need not be numbers but they
athos@1540
    36
must identify the nodes uniquely for later reference. For example:
deba@1114
    37
deba@1114
    38
\code
deba@1114
    39
@nodeset
deba@1114
    40
id  x-coord  y-coord  color
deba@1114
    41
3   1.0      4.0      blue
deba@1114
    42
5   2.3      5.7      red
deba@1114
    43
12  7.8      2.3      green
deba@1114
    44
\endcode
deba@1114
    45
deba@1114
    46
The edgeset section is very similar to the nodeset section, it has
athos@1522
    47
the same coloumn oriented structure. It starts with the line 
athos@1522
    48
athos@1522
    49
<tt>\@edgeset</tt>
athos@1522
    50
athos@1540
    51
The next line contains the whitespace separated list of names of the edge
athos@1540
    52
maps.  Each of the next lines describes one edge. The first two elements in
athos@1540
    53
the line are the IDs of the source and target (or tail and head) nodes of the
athos@1540
    54
edge as they occur in the ID node map of the nodeset section. You can also
athos@1540
    55
have an optional ID map on the edges for later reference (which has to be
athos@1540
    56
unique in this case).
deba@1114
    57
deba@1114
    58
\code
deba@1114
    59
@edgeset
deba@1114
    60
             id    weight   label
deba@1114
    61
3   5        a     4.3      a-edge
deba@1114
    62
5   12       c     2.6      c-edge
deba@1114
    63
3   12       g     3.4      g-edge
deba@1114
    64
\endcode
deba@1114
    65
athos@1540
    66
The \e nodes section contains <em>labeled (distinguished) nodes</em> 
athos@1540
    67
(i.e. nodes having a special
alpar@1118
    68
label on them). The section starts with
athos@1522
    69
athos@1522
    70
<tt> \@nodes </tt>
athos@1522
    71
athos@1522
    72
Each of the next lines contains a label for a node in the graph 
athos@1540
    73
and then the ID as described in the \e nodeset section.
deba@1114
    74
deba@1114
    75
\code
deba@1114
    76
@nodes 
deba@1114
    77
source 3
deba@1114
    78
target 12
deba@1114
    79
\endcode
deba@1114
    80
athos@1540
    81
The last section describes the <em>labeled (distinguished) edges</em>
deba@1333
    82
(i.e. edges having a special label on them). It starts with \c \@edges
deba@1114
    83
and then each line contains the name of the edge and the ID.
deba@1114
    84
deba@1114
    85
\code
athos@1540
    86
@edges 
deba@1114
    87
observed c
deba@1114
    88
\endcode
deba@1114
    89
deba@1114
    90
deba@1114
    91
The file may contain empty lines and comment lines. The comment lines
deba@1114
    92
start with an \c # character.
deba@1114
    93
deba@1532
    94
The attributes section can handle some information about the graph. It
athos@1540
    95
contains key-value pairs in each line (a key and the mapped value to key). The
athos@1540
    96
key should be a string without whitespaces, the value can be of various types.
deba@1532
    97
deba@1532
    98
\code
deba@1532
    99
@attributes
deba@1532
   100
title "Four colored plan graph"
deba@1532
   101
author "Balazs DEZSO"
deba@1532
   102
copyright "Lemon Library"
deba@1532
   103
version 12
deba@1532
   104
\endcode
deba@1532
   105
athos@1522
   106
<tt> \@end </tt>
athos@1522
   107
athos@1522
   108
line.
athos@1522
   109
deba@1114
   110
deba@1114
   111
\section use Using graph input-output
athos@1540
   112
athos@1540
   113
The easiest way of using graph input and output is using the versions of the
athos@1540
   114
  public \ref readGraph() and \ref writeGraph() functions; if you don't need
athos@1540
   115
  very sophisticated behaviour then you might be satisfied with
athos@1540
   116
  those. Otherwise go on reading this page.
athos@1540
   117
athos@1540
   118
The graph input and output is based on <em> reading and writing
athos@1540
   119
commands</em>. The user gives reading and writing commands to the reader or
athos@1540
   120
writer class, then he calls the \c run() method that executes all the given
athos@1540
   121
commands.
deba@1114
   122
deba@1114
   123
\subsection write Writing a graph
deba@1114
   124
deba@1114
   125
The \c GraphWriter class provides the graph output. To write a graph
athos@1526
   126
you should first give writing commands to the writer. You can declare
athos@1540
   127
writing command as \c NodeMap or \c EdgeMap writing and labeled Node and
deba@1114
   128
Edge writing.
deba@1114
   129
deba@1114
   130
\code
deba@1333
   131
GraphWriter<ListGraph> writer(std::cout, graph);
deba@1114
   132
\endcode
deba@1114
   133
deba@1394
   134
The \c writeNodeMap() function declares a \c NodeMap writing command in the
athos@1540
   135
\c GraphWriter. You should give a name to the map and the map
athos@1522
   136
object as parameters. The NodeMap writing command with name "id" should write a 
athos@1540
   137
unique map because it will be regarded as an ID map.
deba@1114
   138
deba@1114
   139
\see IdMap, DescriptorMap  
deba@1114
   140
deba@1114
   141
\code
deba@1114
   142
IdMap<ListGraph, Node> nodeIdMap;
deba@1394
   143
writer.writeNodeMap("id", nodeIdMap);
deba@1114
   144
deba@1394
   145
writer.writeNodeMap("x-coord", xCoordMap);
deba@1394
   146
writer.writeNodeMap("y-coord", yCoordMap);
deba@1394
   147
writer.writeNodeMap("color", colorMap);
deba@1114
   148
\endcode
deba@1114
   149
deba@1394
   150
With the \c writeEdgeMap() member function you can give an edge map
deba@1333
   151
writing command similar to the NodeMaps.
deba@1114
   152
deba@1114
   153
\see IdMap, DescriptorMap  
athos@1522
   154
deba@1114
   155
\code
deba@1114
   156
DescriptorMap<ListGraph, Edge, ListGraph::EdgeMap<int> > edgeDescMap(graph);
deba@1394
   157
writer.writeEdgeMap("descriptor", edgeDescMap);
deba@1114
   158
deba@1394
   159
writer.writeEdgeMap("weight", weightMap);
deba@1394
   160
writer.writeEdgeMap("label", labelMap);
deba@1114
   161
\endcode
deba@1114
   162
athos@1522
   163
With \c writeNode() and \c writeEdge() functions you can designate Nodes and
athos@1522
   164
Edges in the graph. For example, you can write out the source and target node
athos@1522
   165
of a maximum flow instance.
deba@1114
   166
deba@1114
   167
\code
deba@1394
   168
writer.writeNode("source", sourceNode);
deba@1394
   169
writer.writeNode("target", targetNode);
deba@1114
   170
deba@1394
   171
writer.writeEdge("observed", edge);
deba@1114
   172
\endcode
deba@1114
   173
deba@1532
   174
With \c writeAttribute() function you can write an attribute to the file.
deba@1532
   175
deba@1532
   176
\code
deba@1532
   177
writer.writeAttribute("author", "Balazs DEZSO");
deba@1532
   178
writer.writeAttribute("version", 12);
deba@1532
   179
\endcode
deba@1532
   180
deba@1114
   181
After you give all write commands you must call the \c run() member
athos@1522
   182
function, which executes all the writing commands.
deba@1114
   183
deba@1114
   184
\code
deba@1114
   185
writer.run();
deba@1114
   186
\endcode
deba@1114
   187
deba@1114
   188
\subsection reading Reading a graph
deba@1114
   189
athos@1540
   190
The file to be read may contain several maps and labeled nodes or edges.
deba@1114
   191
If you read a graph you need not read all the maps and items just those
deba@1114
   192
that you need. The interface of the \c GraphReader is very similar to
athos@1522
   193
the GraphWriter but the reading method does not depend on the order of the
deba@1114
   194
given commands.
deba@1114
   195
athos@1522
   196
The reader object assumes that each not readed value does not contain 
alpar@1118
   197
whitespaces, therefore it has some extra possibilities to control how
alpar@1118
   198
it should skip the values when the string representation contains spaces.
deba@1114
   199
deba@1114
   200
\code
deba@1333
   201
GraphReader<ListGraph> reader(std::cin, graph);
deba@1114
   202
\endcode
deba@1114
   203
athos@1540
   204
The \c readNodeMap() function reads a map from the \c nodeset section.
athos@1522
   205
If there is a map that you do not want to read from the file and there are
athos@1522
   206
whitespaces in the string represenation of the values then you should
deba@1114
   207
call the \c skipNodeMap() template member function with proper parameters.
deba@1114
   208
deba@1114
   209
\see QuotedStringReader
athos@1522
   210
deba@1114
   211
\code
deba@1394
   212
reader.readNodeMap("x-coord", xCoordMap);
deba@1394
   213
reader.readNodeMap("y-coord", yCoordMap);
deba@1114
   214
deba@1394
   215
reader.readNodeMap<QuotedStringReader>("label", labelMap);
deba@1114
   216
reader.skipNodeMap<QuotedStringReader>("description");
deba@1114
   217
deba@1394
   218
reader.readNodeMap("color", colorMap);
deba@1114
   219
\endcode
deba@1114
   220
deba@1394
   221
With the \c readEdgeMap() member function you can give an edge map
deba@1114
   222
reading command similar to the NodeMaps. 
deba@1114
   223
deba@1114
   224
\code
deba@1394
   225
reader.readEdgeMap("weight", weightMap);
deba@1394
   226
reader.readEdgeMap("label", labelMap);
deba@1114
   227
\endcode
deba@1114
   228
deba@1394
   229
With \c readNode() and \c readEdge() functions you can read labeled Nodes and
deba@1114
   230
Edges.
deba@1114
   231
deba@1114
   232
\code
deba@1394
   233
reader.readNode("source", sourceNode);
deba@1394
   234
reader.readNode("target", targetNode);
deba@1114
   235
deba@1394
   236
reader.readEdge("observed", edge);
deba@1114
   237
\endcode
deba@1114
   238
deba@1532
   239
With \c readAttribute() function you can read an attribute from the file.
deba@1532
   240
deba@1532
   241
\code
deba@1532
   242
std::string author;
deba@1532
   243
writer.readAttribute("author", author);
deba@1532
   244
int version;
deba@1532
   245
writer.writeAttribute("version", version);
deba@1532
   246
\endcode
deba@1532
   247
deba@1114
   248
After you give all read commands you must call the \c run() member
athos@1522
   249
function, which executes all the commands.
deba@1114
   250
deba@1114
   251
\code
deba@1114
   252
reader.run();
deba@1114
   253
\endcode
deba@1114
   254
athos@1540
   255
\anchor rwbackground
athos@1527
   256
\section types Background of Reading and Writing
athos@1540
   257
athos@1540
   258
athos@1527
   259
To read a map (on the nodes or edges)
athos@1527
   260
the \c GraphReader should know how to read a Value from the given map.
deba@1114
   261
By the default implementation the input operator reads a value from
deba@1114
   262
the stream and the type of the readed value is the value type of the given map.
deba@1114
   263
When the reader should skip a value in the stream, because you do not
athos@1527
   264
want to store it in a map, the reader skips a character sequence without 
athos@1540
   265
whitespaces. 
deba@1114
   266
deba@1114
   267
If you want to change the functionality of the reader, you can use
deba@1114
   268
template parameters to specialize it. When you give a reading
deba@1114
   269
command for a map you can give a Reader type as template parameter.
deba@1333
   270
With this template parameter you can control how the Reader reads
deba@1114
   271
a value from the stream.
deba@1114
   272
deba@1114
   273
The reader has the next structure: 
deba@1114
   274
\code
deba@1114
   275
struct TypeReader {
deba@1114
   276
  typedef TypeName Value;
deba@1114
   277
deba@1114
   278
  void read(std::istream& is, Value& value);
deba@1114
   279
};
deba@1114
   280
\endcode
deba@1114
   281
athos@1527
   282
For example, the \c "strings" nodemap contains strings and you do not need
athos@1540
   283
the value of the string just the length. Then you can implement an own Reader
deba@1114
   284
struct.
deba@1114
   285
deba@1114
   286
\code
deba@1114
   287
struct LengthReader {
deba@1114
   288
  typedef int Value;
deba@1114
   289
deba@1114
   290
  void read(std::istream& is, Value& value) {
deba@1114
   291
    std::string tmp;
deba@1114
   292
    is >> tmp;
deba@1114
   293
    value = tmp.length();
deba@1114
   294
  }
deba@1114
   295
};
deba@1114
   296
...
deba@1394
   297
reader.readNodeMap<LengthReader>("strings", lengthMap);
deba@1114
   298
\endcode  
deba@1114
   299
deba@1114
   300
The global functionality of the reader class can be changed by giving a
athos@1526
   301
special template parameter to the GraphReader class. By default, the
alpar@1118
   302
template parameter is \c DefaultReaderTraits. A reader traits class 
athos@1540
   303
should provide an inner template class Reader for each type, and a 
deba@1114
   304
DefaultReader for skipping a value.
deba@1114
   305
athos@1540
   306
The specialization of  writing is very similar to that of reading.
deba@1114
   307
athos@1540
   308
\section undir Undirected graphs
deba@1532
   309
athos@1540
   310
In a file describing an undirected graph (undir graph, for short) you find an
athos@1540
   311
\c undiredgeset section instead of the \c edgeset section. The first line of
athos@1540
   312
the section describes the names of the maps on the undirected egdes and all
athos@1540
   313
next lines describe one undirected edge with the the incident nodes and the
athos@1540
   314
values of the map.
deba@1532
   315
athos@1540
   316
The format handles directed edge maps as a syntactical sugar???, if there
athos@1540
   317
are two maps with names being the same with a \c '+' and a \c '-' prefix
athos@1540
   318
then this will be read as a directed map.
deba@1532
   319
deba@1532
   320
\code
deba@1532
   321
@undiredgeset
deba@1532
   322
             id    capacity +flow -flow
deba@1532
   323
32   2       1     4.3      2.0	  0.0
deba@1532
   324
21   21      5     2.6      0.0   2.6
deba@1532
   325
21   12      8     3.4      0.0   0.0
deba@1532
   326
\endcode
deba@1532
   327
athos@1540
   328
The \c edges section is changed to \c undiredges section. This section
deba@1532
   329
describes labeled edges and undirected edges. The directed edge label
athos@1540
   330
should start with a \c '+' or a \c '-' prefix to decide the direction
deba@1532
   331
of the edge. 
deba@1532
   332
deba@1532
   333
\code
deba@1532
   334
@undiredges
deba@1532
   335
undiredge 1
deba@1532
   336
+edge 5
deba@1532
   337
-back 5
deba@1532
   338
\endcode
deba@1532
   339
deba@1532
   340
There are similar classes to the \c GraphReader ans \c GraphWriter
deba@1532
   341
which handle the undirected graphs. These classes are the 
deba@1532
   342
\c UndirGraphReader and \UndirGraphWriter.
deba@1532
   343
deba@1532
   344
The \c readUndirMap() function reads an undirected map and the
deba@1532
   345
\c readUndirEdge() reads an undirected edge from the file, 
deba@1532
   346
deba@1532
   347
\code
deba@1532
   348
reader.readUndirEdgeMap("capacity", capacityMap);
deba@1532
   349
reader.readEdgeMap("flow", flowMap);
deba@1532
   350
...
deba@1532
   351
reader.readUndirEdge("undir_edge", undir_edge);
deba@1532
   352
reader.readEdge("edge", edge);
deba@1532
   353
\endcode
deba@1532
   354
deba@1532
   355
\section advanced Advanced features
deba@1532
   356
athos@1540
   357
The graph reader and writer classes give an easy way to read and write
athos@1540
   358
graphs. But sometimes we want more advanced features. In this case we can
athos@1540
   359
use the more general <tt>lemon reader and writer</tt> interface.
deba@1532
   360
athos@1540
   361
The LEMON file format is a section oriented file format. It contains one or
athos@1540
   362
more sections, each starting with a line identifying its type 
athos@1540
   363
(the word starting with the \c \@  character).
deba@1532
   364
The content of the section this way cannot contain line with \c \@ first
deba@1532
   365
character. The file may contains comment lines with \c # first character.
deba@1532
   366
deba@1532
   367
The \c LemonReader and \c LemonWriter gives a framework to read and
deba@1532
   368
write sections. There are various section reader and section writer
deba@1532
   369
classes which can be attached to a \c LemonReader or a \c LemonWriter.
deba@1532
   370
deba@1532
   371
There are default section readers and writers for reading and writing
athos@1540
   372
item sets, and labeled items in the graph. These read and write
deba@1532
   373
the format described above. Other type of data can be handled with own
deba@1532
   374
section reader and writer classes which are inherited from the
deba@1532
   375
\c LemonReader::SectionReader or the \c LemonWriter::SectionWriter classes.
deba@1532
   376
deba@1532
   377
The next example defines a special section reader which reads the
deba@1532
   378
\c \@description sections into a string:
deba@1532
   379
deba@1532
   380
\code 
deba@1532
   381
class DescriptionReader : LemonReader::SectionReader {
deba@1532
   382
protected:
deba@1532
   383
  virtual bool header(const std::string& line) {
deba@1532
   384
    std::istringstream ls(line);
deba@1532
   385
    std::string head;
deba@1532
   386
    ls >> head;
deba@1532
   387
    return head == "@description";
deba@1532
   388
  }
deba@1532
   389
deba@1532
   390
  virtual void read(std::istream& is) {
deba@1532
   391
    std::string line;
deba@1532
   392
    while (getline(is, line)) {
deba@1532
   393
      desc += line;
deba@1532
   394
    }
deba@1532
   395
  }
deba@1532
   396
public:
deba@1532
   397
deba@1532
   398
  typedef LemonReader::SectionReader Parent;
deba@1532
   399
  
deba@1532
   400
  DescriptionReader(LemonReader& reader) : Parent(reader) {}
deba@1532
   401
deba@1532
   402
  const std::string& description() const {
deba@1532
   403
    return description;
deba@1532
   404
  }
deba@1532
   405
deba@1532
   406
private:
deba@1532
   407
  std::string desc;
deba@1532
   408
};
deba@1532
   409
\endcode
deba@1532
   410
deba@1532
   411
The other advanced stuff of the generalized file format is that 
deba@1532
   412
multiple edgesets can be stored to the same nodeset. It can be used 
athos@1540
   413
for example as a network traffic matrix.
deba@1532
   414
athos@1540
   415
In our example there is a network with symmetric links and there are assymetric
deba@1532
   416
traffic request on the network. This construction can be stored in an
athos@1540
   417
undirected graph and in a directed NewEdgeSetAdaptor class. The example
deba@1532
   418
shows the input with the LemonReader class:
deba@1532
   419
deba@1532
   420
\code
deba@1532
   421
UndirListGraph network;
deba@1532
   422
UndirListGraph::UndirEdgeSet<double> capacity;
deba@1532
   423
NewEdgeSetAdaptor<UndirListGraph> traffic(network);
deba@1532
   424
NewEdgeSetAdaptor<UndirListGraph>::EdgeSet<double> request(network);
deba@1532
   425
deba@1532
   426
LemonReader reader(std::cin);
deba@1532
   427
NodeSetReader nodesetReader(reader, network);
deba@1532
   428
UndirEdgeSetReader undirEdgesetReader(reader, network, nodesetReader);
deba@1532
   429
undirEdgesetReader.readEdgeMap("capacity", capacity);
deba@1532
   430
EdgeSetReader edgesetReader(reader, traffic, nodesetReader);
deba@1532
   431
edgesetReader.readEdgeMap("request", request);
deba@1532
   432
deba@1532
   433
reader.run();
deba@1532
   434
\endcode
deba@1532
   435
deba@1532
   436
Because the GraphReader and the UndirGraphReader can be converted
deba@1532
   437
to LemonReader and it can resolve the ID's of the items, the previous
athos@1540
   438
result can be achived with the UndirGraphReader class, too.
deba@1532
   439
deba@1532
   440
deba@1532
   441
\code
deba@1532
   442
UndirListGraph network;
deba@1532
   443
UndirListGraph::UndirEdgeSet<double> capacity;
deba@1532
   444
NewEdgeSetAdaptor<UndirListGraph> traffic(network);
deba@1532
   445
NewEdgeSetAdaptor<UndirListGraph>::EdgeSet<double> request(network);
deba@1532
   446
deba@1532
   447
UndirGraphReader reader(std::cin, network);
deba@1532
   448
reader.readEdgeMap("capacity", capacity);
deba@1532
   449
EdgeSetReader edgesetReader(reader, traffic, reader);
deba@1532
   450
edgesetReader.readEdgeMap("request", request);
deba@1532
   451
deba@1532
   452
reader.run();
deba@1532
   453
\endcode
deba@1532
   454
deba@1333
   455
\author Balazs Dezso
deba@1114
   456
*/
deba@1333
   457
}