Symbolic Link (SYLK)
| Symbolic Link (SYLK) | |
|---|---|
| Filename extension |
.slk |
| Internet media type | application/x-sylk |
| Developed by | Microsoft |
Symbolic Link (SYLK) is a Microsoft file format typically used to exchange data between applications, specifically spreadsheets. SYLK files conventionally have a .slk suffix. Composed of only displayable ANSI characters, it can be easily created and processed by other applications, such as databases.
Microsoft has never published a SYLK specification. Variants of the format are supported by Multiplan, Microsoft Excel, Microsoft Works, OpenOffice.org, LibreOffice[1] and Gnumeric. The format was introduced in the 1980s and has not evolved since 1986.[2]
A commonly encountered (and spurious) 'occurrence' of the SYLK file happens when a comma-separated value (CSV) format is saved with an unquoted first field name of 'ID', that is the first two characters match the first two characters of the SYLK file format. Microsoft Excel (at least to Office 2016) will then emit misleading error messages relating to the format of the file, such as "The file you are trying to open, 'x.csv', is in a different format than specified by the file extension...".[3][4]
SYLK is known to cause security issues, as it allows an attacker to run arbitrary code, offers the opportunity to disguise the attack vector under the benign-looking appearance of a CSV file, and is still enabled by default on recent (2016) versions of Microsoft Excel.[2]
Limitations
SYLK does not have support for Unicode. Even if a SYLK file is created by an application that supports Unicode (for example Microsoft Excel), the SYLK file will be encoded in the current system's ANSI code page, not in Unicode. If the application contained characters that were displayable in Unicode but have no code point in the current system's code page, they will be converted to question marks ('?') in the SYLK file.
The semicolon is treated as a field separator in SYLK, so cannot be used unescaped in data values. If a character string in the SYLK file is to contain a semicolon (;) then it should be prefixed with another semicolon so the string would appear as e.g., "WIDGET;;AXC1254". MS Excel will strip the first semicolon on import and the data element will appear as "WIDGET;AXC1254".
Each line of a SYLK input file must be no longer than 260 characters. Otherwise, Microsoft Excel will issue an error message and skip loading the overlong line.
Description
Below is the contents of a sample SYLK file:
ID;P C;Y1;X1;K"Row 1" C;Y2;X1;K"Row 2" C;Y3;X1;K"Total" C;Y1;X2;K11 C;Y2;X2;K22 C;Y3;X2;K0;ER1C2+R2C2 E
This file, when read by a spreadsheet application that supports SYLK would be displayed like the following:
| Row 1 | 11 |
| Row 2 | 22 |
| Total | 33 |
SYLK files consist of a sequence of records with semicolon-delimited lists of fields. In this file, the ID record (specifically the ID; at the start of the file) acts as a magic number, identifying the file as a SYLK file, as all SYLK files must start with an ID record. The P field without a value indicates the file was not generated by Microsoft Excel or Microsoft's older spreadsheet application Multiplan. Each C record describes one cell in the spreadsheet located at cell (X,Y) (the values of the X and Y fields). The K fields define the raw value of each cell. Strings of text are enclosed in quotation marks. Otherwise, the value is listed without quotes. Dates and times are internally stored as numbers of whole and fractional days since a set epoch and displayed accordingly with custom cell formatting options. The E field defines the value of the cell to be the value of an expression (equivalent, here, to the more familiar formula =B1+B2). Lastly, the E record marks the end of the file. It must be the last record in the file.
Formatting
The F record is used to define and apply styles to cells, and P records can be used to define shared number formats, along with font information. For example:
ID;P
P;PGeneral
P;P_(* #,##0_);;_(* \-#,##0_);;_(* "-"_);;_(@_)
P;P_(* #,##0.00_);;_(* \(#,##0.00\);;_(* "-"??_);;_(@_)
P;P_("$"* #,##0.00_);;_("$"* \(#,##0.00\);;_("$"* "-"??_);;_(@_)
C;Y1;X1;K"Row 1"
C;Y2;X1;K"Row 2"
C;Y3;X1;K"Total"
C;Y1;X2;K11
C;Y2;X2;K22
C;Y3;X2;K0;ER1C2+R2C2
F;P2;C2
E
is rendered similar to the following:
| Row 1 | 11.00 |
| Row 2 | 22.00 |
| Total | 33.00 |
The F record's P2 field indicates that the number format in the third and penultimate P record is being used, and the record's C2 field applies that format to the second column. Indices of P records are zero-indexed, while columns and rows are one-indexed. In this example, the following number formats are defined:
P0, the general number format, which displays the number as it is listed in the file,P1, a number format that uses thousands separators and no decimal places,P2(applied to column two), the same number format asP1but with two decimal places,- and
P3, the same number format asP2but with a dollar sign at the start.
The W field of the F record defines the widths of a group of columns in characters. For instance, the records F;W1 1 20 and F;W2 2 30 set the widths of the first two columns of a spreadsheet to 20 and 30 characters wide respectively. Likewise, F;W1 5 5 would set the widths of the first five columns to be five characters wide each.
The F field of the F record defines both the number format and text alignment of a cell, column, row, or spreadsheet. Unlike the P record, this field only supports a basic set of number formats. As an example, a F;FD3C record would use the default number format with three digits and center-aligned text.
The following SYLK file:
ID;P P;PGeneral C;Y1;X1;K"Row 1 Left Justify" F;P0;FG0L C;Y2;X1;K"Row 2 Right Justify" F;P0;FG0R C;Y3;X1;K"Total at Center" F;P0;FG0C C;Y1;X2;K11 C;Y2;X2;K22 C;Y3;X2;K0;ER1C2+R2C2 F;Y1;X2;FF2L F;Y2;X2;FF2R F;Y3;X2;F$2C F;W1 2 25 E
would be rendered, ignoring the incorrect column widths, like:
| Row 1 Left Justify | 11.00 |
| Row 2 Right Justify | 22.00 |
| Total at Center | $33.00 |
The P record can also be used to store font information, in addition to the number format. A cell with a P;FCambria;M240;SB;L4 record applied would be rendered using the Cambria font (FCambria), 12 point font (M240: the value of the M field divided by 20 is the font size in points), boldface (SB), and green (L4).[5]
Syntax
The basic syntax of a SYLK file is shown below (in the ABNF syntax):
SYLK-file = 1*(Record)
Record = RecordType *(";" Field) LF
A list of valid record types along with their valid fields is shown below:
Brecord- A record used to tell the number of rows and columns in a spreadsheet. It is recommended that it comes before the
CandFrecords. Supported fields:
Field Name Description Use Xcolumns The number of columns in the spreadsheet Mandatory Yrows The number of rows in the spreadsheet Mandatory
Crecord- A record used to describe a cell's contents.
Field Name Description Use Ccolumn reference References another cell by its column number. Optional DDefines a shared expression. If this field is present, the Efield must also be present.Optional Eexpression An expression or formula whose value is the value of the cell Optional GDefines a shared value. If this field is present, the Efield must be absent and theKfield must be present.Optional Hhidden If present, the cell is hidden. Optional Iinside matrix? If present, the cell is inside a matrix, and the KandEfields must be absent.Optional Kvalue A field whose value is the value of the cell Optional Mmatrix expression A matrix expression from cell (X,Y)to cell(C,R). If this field is present, theEfield must be absent.Optional Nnot protected? If present, or the Nfield is present in theIDrecord, the cell is not protected. If theIDrecord does not contain theNfield, this field must be absent.Optional Pprotected? If present, or if the Nfield is not present in theIDrecord, the cell is protected. If theIDrecord contains theNfield, this field must be absent.Optional Rrow reference References another cell by its row number. Optional Sshared reference References a shared value ( Gfield) or expression (Dfield). If present, theD,E,G,K, andMfields must all be absent and theRandCfields must be present, as theRandCfields define the row and column the shared value or expression is copied from.Optional Xcolumn position One-indexed column number of the cell Mandatory Yrow position One-indexed row number of the cell. If this field is omitted, the last defined Yfield's value is used instead.Optional
Erecord- A record that marks the end of the file. It must be the last record in the file.
Frecord- A record that follows any
Precords and applies the formats described by those records (or fields described by theFrecord itself) to the entire spreadsheet or a single cell, column, or row. Supported fields:
Field Name Description Use Ccolumn One-indexed column number. If present, the E,G,H,K,N,R,W,X, andYfields must all be absent and one or more of the following fields must also be present:P,F, orS.Optional Ddefault format Describes a default format. The form is near-identical to the Ffield below. The only difference is that theDspecifiers for the number format and cell alignment are both invalid in this field. If this field is present, theC,R,X, andYfields must all be absent.Optional Eshow formula? If present and the cell's value is defined by an expression or formula, that expression or formula is displayed instead of its value. Optional Fformat The cell, row, or column format, taking the form of ⟨ch1⟩⟨digits⟩⟨ch2⟩.⟨ch1⟩, the number format (and⟨digits⟩is the number of decimal digits after the decimal separator), is one of the following:C, the continuous cross-cell display format;D, the default number format;E, the scientific notation or exponential format;F, the fixed decimal point format;G, the general number format;$, the dollar format, which precedes the number with a dollar sign ($) and uses two decimal places;*, the bar graph format, with one asterisk for every unit (eg:5would be*****);- or
%, the percentage format.
and
⟨ch2⟩is the text alignment, and can be one of:C, center alignment;D, the default alignment;G, general alignment, which is left alignment for text and right alignment for numbers;L, left alignment;R, right alignment;X, fill;- and
-, ignored.
Optional Gshow grid lines? If absent in the entire file, the default grid lines are shown. Optional Hshow headers? If absent in the entire file, column and row headers are shown. Optional Kshow commas? If present and the cell has a numeric value, group separators (commas) are shown. Optional Nfont The font of the cell, column, or row in the form: ⟨fontId⟩ ⟨size⟩.Optional Pindexed number format The indexed Excel-style cell number format, a zero-indexed number corresponding to which Precord should be used. For instance,P3selects the number format defined by the fourthPrecord.Optional Rrow One-indexed row number. If present, the C,E,G,H,K,N,W,X, andYfields must all be absent and one or more of the following fields must also be present:P,F, orS.Optional Sstyle A list of characters describing the text formatting, cell borders, and backgrounds of the cell, row, column, or spreadsheet. The following characters are used: B, for bottom grid lines;D, for boldface;I, for italic type;L, for left grid lines;R, for right grid lines;S, for shaded backgrounds;- and
T, for top grid lines.
Optional Wcolumn width Sets the widths of a range of columns in the form ⟨col1⟩⟨col2⟩ ⟨width⟩, where⟨col1⟩and⟨col2⟩are one-indexed column numbers and⟨width⟩is the width of the columns in charactersOptional Xcell column One-indexed cell column number. If present, the Yfield must be present, theC,E,G,H,K,N,R, andWfields must all be absent, and one or more of the following fields must also be present:P,F, orS.Optional Ycell row One-indexed cell row number. If present, the Xfield must be present.Optional
- At least one of the following fields must be present:
C,D,E,G,H,K,P,R,W,X, orY. - The
XandYfields define cell formats, theRfield defines default row formats, theCfield defines default cell formats, and theDfield sets the default spreadsheet format. IDrecord- A header record used to identify the type and creator of a spreadsheet. It must be the first record in the file. Supported fields:
Field Name Description Use EIf present, NErecords are considered redundant.Optional NIf present, the file uses N-field-style cell protection, otherwise it usesP-field-style cell protection.Optional Pprogram The creator of the file. Can include: CL, for Microsoft Excel,MP, for Multiplan (Microsoft)- or no value.
Mandatory
NErecord- A record that links an area on the spreadsheet to another area on an external spreadsheet. Must be placed after any
NUrecords. Supported fields:
Field Name Description Use Edestination An expression that defines the target area on the spreadsheet to link the external data Mandatory Ffile The name of the file containing the external data Mandatory Ssource An expression that defines the source area on the external spreadsheet whose data is being linked Mandatory
NLrecord- A record that describes chart external links.
NNrecord- A record used to define named functions, macros, and expressions. It is more efficient if an
NNrecord is defined prior to when its name is first used. Supported fields:
Field Name Description Use Eexpression An expression describing the value of the named function, macro, or expression. Mandatory FIf present, the named expression can be used as a function. Optional GA field whose value is of the form ⟨ch⟩ ⟨ch2⟩that indicates a runnable, named macro with command key alias. If this field is present, theKfield must be absent.Optional KA field whose value is of the form ⟨ch⟩ ⟨ch2⟩that indicates an ordinary, named expression with unused command aliases. If this field is present, theGfield must be absent.Optional Nname The name of the function, macro, or expression. Mandatory
NUrecord- A record that precedes any
NErecords and describes file name substitutions. Supported fields:
Field Name Description Use Fnew file The new file name Mandatory Lold file The old file name Mandatory
Orecord- A record that describes spreadsheet options. Supported fields:
Field Name Description Use CA completion test at the nearest preceding C record. Optional EA macro sheet. This should appear before the first appearance of a ForGfield in anNNrecord as well as before the firstCrecord that uses a macro-only function.Optional GIf present, allows value iteration and circular references. Takes the form ⟨iter⟩ ⟨delta⟩, where⟨iter⟩is the maximum number of iterations and⟨delta⟩is a step test that, if smaller, is present.Optional LIf present, cell references use the A1-style. In spite of this, even if present, the SYLK file itself still internally usesR1C1-style references in expressions.Optional MIf present, cells are manually recalculated. Otherwise, they are automatically recalculated. Optional Psheet protected? If present, the sheet is protected. Optional Vbase date value Indicates the date system used for calculating serial date values. This is either 0for the 1900 date system or4for the 1904 date system.[6] See below, after the end of the list of records, for more details.Optional
Precord[5]- A record that describes cell formatting and precedes any
Frecords. Supported fields:
Field Name Description Use Eextended font definition An extended font definition Optional Ffont definition An normal font definition Optional Lfont color The color of the font. Optional Mfont size The font size of the format in 1⁄20 point units Optional Pformatting An Excel-style cell format specification Mandatory Sfont style The style of the font Optional
Wrecord- A record that describes window definitions.
Date and time values are stored as a floating point numbers. The whole number part is the number of days after either 1 January 1900 (if the O record has a V field with value 0, indicating the 1900 date system is being used) or 1 January 1904 (if instead this value is 4, indicating the 1904 date system is in use). The 1904 date system is only used in older versions of Microsoft Excel on Macintosh systems prior to 2011.[6] The decimal component is the number of seconds divided by 86,400 (the number of seconds in a day) that have elapsed since 00:00 or 12:00 AM.
It is possible to convert serial date values to Unix time by subtracting 25,569 or 24,109 (the number of days between 1 January 1970, the Unix epoch, and 1 January 1900 or 1 January 1904 respectively), and then multiplying by 86,400. Conversion from Unix time to SYLK time can be done by doing the reverse (dividing by 86,400 and adding 25,569 or 24,109). However, to be displayed properly, the formatting of the cell must be set to a custom format such as P;Pdd/mm/yyyy\ hh:mm:ss.
Microsoft Excel limits cells in imported SYLK files to 255 characters. LibreOffice does not have this limitation.
External links
- Detailed examples can also be found at here
Syntax for SYLK can be found at:
- Microsoft SYLK summary (see also the Abusing the SYLK file format article for some useful information)
- GFF Format Summary: Microsoft SYLK
and at:
Limits related to reading and saving with Excel:
References
- ^ "LibreOffice Calc - Supported File Formats". 12 November 2016.
- ^ a b Stan Hegt (2019-10-30). "Abusing the SYLK file format". outflank.nl. Retrieved 2019-12-14.
- ^ ""SYLK: File format is not valid" error message when you open file". Retrieved August 18, 2015.
- ^ "Excel CSV import returns an SYLK file format error". 9 January 2012. Retrieved August 18, 2015.
- ^ a b "Excel: SYLK File Format Additional P Record Fields". The Microsoft KnowledgeBase Archive. 2 November 1997. Retrieved 10 March 2025.
- ^ a b French, Ted (8 July 2019). "Serial Number and Serial Date in Excel". Lifewire. Retrieved 9 March 2025.