18.02.2014 Views

string

string

string

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Solaris Common Messages and<br />

Troubleshooting Guide<br />

Sun Microsystems, Inc.<br />

901 San Antonio Road<br />

Palo Alto, CA 94303<br />

U.S.A.<br />

Part No: 805-4036–10<br />

October, 1998


Copyright 1998 Sun Microsystems, Inc. 901 San Antonio Road, Palo Alto, California 94303-4900 U.S.A. All rights reserved.<br />

This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and<br />

decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of<br />

Sun and its licensors, if any. Third-party software, including font technology, is copyrighted and licensed from Sun suppliers.<br />

Parts of the product may be derived from Berkeley BSD systems, licensed from the University of California. UNIX is a registered<br />

trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd.<br />

Sun, Sun Microsystems, the Sun logo, SunSoft, SunDocs, SunExpress, and Solaris are trademarks, registered trademarks, or service marks<br />

of Sun Microsystems, Inc. in the U.S. and other countries. All SPARC trademarks are used under license and are trademarks or registered<br />

trademarks of SPARC International, Inc. in the U.S. and other countries. Products bearing SPARC trademarks are based upon an<br />

architecture developed by Sun Microsystems, Inc.<br />

The OPEN LOOK and Sun TM Graphical User Interface was developed by Sun Microsystems, Inc. for its users and licensees. Sun<br />

acknowledges the pioneering efforts of Xerox in researching and developing the concept of visual or graphical user interfaces for the<br />

computer industry. Sun holds a non-exclusive license from Xerox to the Xerox Graphical User Interface, which license also covers Sun’s<br />

licensees who implement OPEN LOOK GUIs and otherwise comply with Sun’s written license agreements.<br />

RESTRICTED RIGHTS: Use, duplication, or disclosure by the U.S. Government is subject to restrictions of FAR 52.227–14(g)(2)(6/87) and<br />

FAR 52.227–19(6/87), or DFAR 252.227–7015(b)(6/95) and DFAR 227.7202–3(a).<br />

DOCUMENTATION IS PROVIDED “AS IS” AND ALL EXPRESS OR IMPLIED CONDITIONS, REPRESENTATIONS AND WARRANTIES,<br />

INCLUDING ANY IMPLIED WARRANTY OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE OR<br />

NON-INFRINGEMENT, ARE DISCLAIMED, EXCEPT TO THE EXTENT THAT SUCH DISCLAIMERS ARE HELD TO BE LEGALLY<br />

INVALID.<br />

Copyright 1998 Sun Microsystems, Inc. 901 San Antonio Road, Palo Alto, Californie 94303-4900 Etats-Unis. Tous droits réservés.<br />

Ce produit ou document est protégé par un copyright et distribué avec des licences qui en restreignent l’utilisation, la copie, la<br />

distribution, et la décompilation. Aucune partie de ce produit ou document ne peut être reproduite sous aucune forme, par quelque<br />

moyen que ce soit, sans l’autorisation préalable et écrite de Sun et de ses bailleurs de licence, s’il y en a. Le logiciel détenu par des tiers, et<br />

qui comprend la technologie relative aux polices de caractères, est protégé par un copyright et licencié par des fournisseurs de Sun.<br />

Des parties de ce produit pourront être dérivées du système Berkeley BSD licenciés par l’Université de Californie. UNIX est une marque<br />

déposée aux Etats-Unis et dans d’autres pays et licenciée exclusivement par X/Open Company, Ltd.<br />

Sun, Sun Microsystems, le logo Sun, SunSoft, SunDocs, SunExpress, et Solaris sont des marques de fabrique ou des marques déposées, ou<br />

marques de service, de Sun Microsystems, Inc. aux Etats-Unis et dans d’autres pays. Toutes les marques SPARC sont utilisées sous licence<br />

et sont des marques de fabrique ou des marques déposées de SPARC International, Inc. aux Etats-Unis et dans d’autres pays. Les produits<br />

portant les marques SPARC sont basés sur une architecture développée par Sun Microsystems, Inc.<br />

L’interface d’utilisation graphique OPEN LOOK et Sun TM<br />

a été développée par Sun Microsystems, Inc. pour ses utilisateurs et licenciés.<br />

Sun reconnaît les efforts de pionniers de Xerox pour la recherche et le développement du concept des interfaces d’utilisation visuelle ou<br />

graphique pour l’industrie de l’informatique. Sun détient une licence non exclusive de Xerox sur l’interface d’utilisation graphique Xerox,<br />

cette licence couvrant également les licenciés de Sun qui mettent en place l’interface d’utilisation graphique OPEN LOOK et qui en outre<br />

se conforment aux licences écrites de Sun.<br />

CETTE PUBLICATION EST FOURNIE “EN L’ETAT” ET AUCUNE GARANTIE, EXPRESSE OU IMPLICITE, N’EST ACCORDEE, Y<br />

COMPRIS DES GARANTIES CONCERNANT LA VALEUR MARCHANDE, L’APTITUDE DE LA PUBLICATION A REPONDRE A UNE<br />

UTILISATION PARTICULIERE, OU LE FAIT QU’ELLE NE SOIT PAS CONTREFAISANTE DE PRODUIT DE TIERS. CE DENI DE<br />

GARANTIE NE S’APPLIQUERAIT PAS, DANS LA MESURE OU IL SERAIT TENU JURIDIQUEMENT NUL ET NON AVENU.<br />

Please<br />

Recycle


Contents<br />

Preface<br />

xxi<br />

1. About Error Messages 1<br />

Searching for Messages 1<br />

Choosing What To Look For 1<br />

In the Printed Book 2<br />

In the AnswerBook Navigator 2<br />

Combining Search Techniques 4<br />

Understanding the Message Explanations 4<br />

2. Alphabetical Message Listing 7<br />

Numbers and Symbols 7<br />

***** FILE SYSTEM WAS MODIFIED ***** 7<br />

** Phase 1-- Check Blocks and Sizes 8<br />

** Phase 1b-- Rescan For More DUPS 8<br />

** Phase 2-- Check Pathnames 9<br />

** Phase 3-- Check Connectivity 9<br />

** Phase 4-- Check Reference Counts 10<br />

** Phase 5-- Check Cyl groups 10<br />

29a00 illegal instruction 11<br />

451 timeout waiting for input during source 12<br />

Contents<br />

iii


"A" 14<br />

"B" 21<br />

550 hostname... Host unknown 12<br />

550 username... User unknown 13<br />

554 hostname ... Local configuration error 13<br />

Accessing a corrupted shared library 14<br />

access violation unknown host IP address 15<br />

A command window has exited because<br />

its child exited. 15<br />

Address already in use 16<br />

Address family not supported by protocol family 16<br />

admintool: Received communication service error 4 16<br />

Advertise error 17<br />

answerbook: XView error: NULL pointer<br />

passed to xv_set 17<br />

Arg list too long 18<br />

Argument out of domain 18<br />

Arguments too long 19<br />

assertion failed: <strong>string</strong>, file name, line int 19<br />

Attempting to link in more shared<br />

libraries than system limit 20<br />

automountd[int]: server hostname responding 20<br />

automount[int]: name: Not a directory 21<br />

Bad address 21<br />

BAD/DUP FILE I=i OWNER=o MODE=m SIZE=s<br />

MTIME=t CLEAR? 22<br />

Bad file number 22<br />

block no. BAD I=inode no. 22<br />

BAD_MESSAGE (error code 100) from X.400 23<br />

iv Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


"C" 30<br />

bad module/chip at: position 23<br />

Bad request descriptor 24<br />

BAD SUPER BLOCK: <strong>string</strong> 24<br />

BAD TRAP 26<br />

/bin/sh: file: too big 26<br />

Block device required 27<br />

Boot device: /iommu/sbus/ directory/directoryliteral>/<br />

sd@3,0 27<br />

boot error 13 | sun4m 28<br />

Broadcast Message from root (pts/int)<br />

on server [date] 28<br />

Broken pipe 29<br />

Bus Error 29<br />

Cannot access a needed shared library 30<br />

Cannot allocate colormap entry for " <strong>string</strong> " 30<br />

Cannot assign requested address 30<br />

cannot change passwd, not correct passwd 31<br />

Cannot exec a shared library directly 31<br />

Cannot find SERVER hostname in network database 32<br />

Cannot open FCC file 33<br />

Cannot send after transport endpoint shutdown 33<br />

Can’t create public message device (Device busy) 33<br />

Can’t invoke /etc/init, error int 34<br />

can’t open /dev/rdsk/<strong>string</strong>: (null):<br />

UNEXPECTED INCONSISTENCY 35<br />

can’t synchronize with hayes 35<br />

cd: Too many arguments 36<br />

Channel number out of range 36<br />

Contents<br />

v


"D" 47<br />

chmod: ERROR: invalid mode 36<br />

Command not found 37<br />

Communication error on send 38<br />

Connection closed. 38<br />

Connection closed by foreign host. 38<br />

[Connection closed. Exiting] 39<br />

Connection refused 39<br />

Connection reset by peer 39<br />

Connection timed out 40<br />

console login: ^J^M^Q^K^K^P 40<br />

core dumped 41<br />

corrupt label - wrong magic number or<br />

corrupt label - label checksum failed 42<br />

could not grant slave pty 43<br />

Could not initialize tooltalk (tt_open):<br />

TT_ERR_NOMP 44<br />

Could not open ToolTalk Channel 44<br />

Could not start new viewer 45<br />

cpio: Bad magic number/header. 45<br />

cpio : can’t read input : end of file<br />

encountered prior to expected end of archive. 46<br />

Cross-device link 46<br />

data access exception 47<br />

Data fault 47<br />

Deadlock situation detected/avoided 48<br />

Destination address required 48<br />

/dev/fd/int: /dev/fd/int: cannot open 49<br />

/dev/rdsk/c0t6d0s2: No such file or directory 50<br />

vi Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


"E" 58<br />

Device busy 50<br />

device busy 51<br />

/dev/rdsk/<strong>string</strong>: CAN’T CHECK FILE SYSTEM. 52<br />

/dev/rdsk/<strong>string</strong>: UNEXPECTED<br />

INCONSISTENCY; RUN fsck MANUALLY. 53<br />

Directory not empty 53<br />

diskN not unique 54<br />

Disc quota exceeded 54<br />

driver is already installed 55<br />

DUMP: Cannot open dump device<br />

‘/dev/rdsk/c2t0d0s1’: Permission denied 55<br />

dumptm: Cannot open ‘/dev/rmt/<strong>string</strong>’: Device busy 56<br />

DUP/BAD I=i OWNER=o MODE=m SIZE=s<br />

MTIME=t FILE=f REMOVE? 57<br />

int DUP I=int 57<br />

ENOMEM The available data space is not<br />

large enough to accommodate the shared<br />

memory segment 58<br />

error 13 59<br />

error 15 initializing 59<br />

Error 76 60<br />

Error 88 60<br />

error: DPS has not initialized or<br />

server connection failed 60<br />

Error: Error adding OS service<br />

Solaris 2.6 sparc sun4u: 61<br />

Error Host Unknown: 62<br />

ERROR: missing file arg (cm3) 62<br />

ERROR [SCCS/s.<strong>string</strong>]: ‘SCCS/p.<strong>string</strong>’<br />

nonexistent (ut4) 63<br />

Contents<br />

vii


"F" 66<br />

"G" 76<br />

ERROR [SCCS/s.<strong>string</strong>]: writable ‘<strong>string</strong>’ exists (ge4) 63<br />

Error: you don’t have a license to<br />

run this program 63<br />

esp0: data transfer overrun 64<br />

Event not found 64<br />

EXCESSIVE BAD BLKS I=int CONTINUE? 65<br />

EXCESSIVE DUP BLKS I=int CONTINUE? 65<br />

Exec format error 66<br />

failed to initialize adapter 66<br />

fbconsole: ioctl SRIOCSREDIR: Device Busy. 67<br />

fd0: unformatted diskette or no<br />

diskette in the drive 68<br />

File descriptor in bad state 68<br />

File exists 68<br />

File locking deadlock 69<br />

filemgr: mknod: Permission denied 69<br />

File name too long 70<br />

file system full 70<br />

FILE SYSTEM STATE IN SUPERBLOCK IS WRONG; FIX? 71<br />

File table overflow 71<br />

File too large 72<br />

FREE BLK COUNT(S) WRONG IN SUPERBLK SALVAGE? 72<br />

fsck: Can’t open /dev/dsk/<strong>string</strong> 73<br />

fsck: Can’t stat /dev/dsk/<strong>string</strong> 73<br />

ftp: ftp/tcp: unknown service 74<br />

fw_ipinput: q fc5fddc0:illegal interface 74<br />

fwm: no license 75<br />

viii Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


"H" 77<br />

"I" 80<br />

giving up 76<br />

Graphics Adapter device /dev/fb is of unknown type 76<br />

group.org_dir: NIS+ servers unreachable 77<br />

hang console 77<br />

/home/<strong>string</strong>: No such file or directory 78<br />

Host is down 78<br />

host name configuration error 79<br />

hosts.org_dir: NIS+ servers unreachable 79<br />

I can’t read your attachments. What<br />

mailer are you using? 80<br />

Identifier removed 80<br />

ie0: Ethernet jammed 81<br />

ie0: no carrier 81<br />

ifconfig: bad address 82<br />

ifconfig bad address le0 82<br />

If pipe/FIFO, don’t sleep in stream head 83<br />

Illegal Instruction 83<br />

Illegal instruction "0xhex" was<br />

encountered at PC 0xhex 84<br />

Illegal seek 84<br />

Image Tool: Unable to open XIL Library. 85<br />

Inappropriate ioctl for device 85<br />

INCORRECT BLOCK COUNT I=int (should be int) CORRECT? 86<br />

index failed:full:index preceded by saveset name 86<br />

inetd[int]: execv /usr/sbin/in.uucpd:<br />

No such file or directory 87<br />

inetd[int]: <strong>string</strong>/tcp: unknown service 87<br />

Contents<br />

ix


"J" 93<br />

"K" 94<br />

"L" 95<br />

inetd[int]: <strong>string</strong>/udp: unknown service 88<br />

inetd: Too many open files 88<br />

INIT: Cannot create /var/adm/utmp or<br />

/var/adm/utmpx 88<br />

InitOutput: Error loading module for /dev/fb 89<br />

Interrupted system call 90<br />

Invalid argument 90<br />

Invalid null command 91<br />

Invalid_SS_JWS_HOME:no<br />

C:\\lib\basicframe.properties 91<br />

I/O error 92<br />

Is a directory 92<br />

java.lang.UnsatisfiedLinkError: 93<br />

kernel read error 94<br />

Killed 94<br />

kmem_free block already free 95<br />

last message repeated int times 95<br />

ld.so.1 fatal: can’t set protection on segment 96<br />

ld.so.1: <strong>string</strong>: fatal: <strong>string</strong>: can’t open<br />

file: errno=2 96<br />

ld.so.1: <strong>string</strong>: fatal: <strong>string</strong>: open<br />

failed: No such file or directory 97<br />

ld.so.1: <strong>string</strong>: fatal: relocation<br />

error: symbol not found: <strong>string</strong> 97<br />

ld.so.1: <strong>string</strong>: fatal: relocation error:<br />

<strong>string</strong>: <strong>string</strong>: referenced symbol not found 98<br />

le0: Memory error! 99<br />

x Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


le0: No carrier-- cable disconnected<br />

or hub link test disabled? 99<br />

le0: No carrier-- transceiver cable problem? 100<br />

level 15 interrupt 101<br />

.lib section in a.out corrupted 101<br />

LINK COUNT FILE I=i OWNER=o MODE=m<br />

SIZE=s MTIME=t COUNT... ADJUST? 101<br />

Link has been severed 102<br />

LL105W: Protocol error detected. 102<br />

ln: cannot create /dev/fb: Read-only file system 102<br />

lockd[int]: create_client: no name for<br />

inet address 0xhex 103<br />

Login incorrect 103<br />

lp hang 104<br />

"M" 105<br />

Machine is not on the network 105<br />

mailtool: Can’t create dead letter:<br />

Permission denied 105<br />

mailtool: Could not initialize the<br />

Classing Engine 106<br />

Mail Tool is confused about the state<br />

of your Mail File. 106<br />

mail: Your mailfile was found to be<br />

corrupted (Content-length mismatch). 107<br />

Machine is not on the network 107<br />

mailtool: Can’t create dead letter:<br />

Permission denied 108<br />

mailtool: Could not initialize the<br />

Classing Engine 108<br />

Mail Tool is confused about the state<br />

of your Mail File. 109<br />

Contents<br />

xi


mail: Your mailfile was found to be<br />

corrupted (Content-length mismatch). 109<br />

file name may contain holes - can’t swap on it. 110<br />

mbuf map full 110<br />

Memory address alignment 110<br />

memory leaks 111<br />

Message too long 111<br />

mount: /dev/dsk/<strong>string</strong> is already<br />

mounted, /<strong>string</strong> is busy, or... 112<br />

mount: giving up on: /<strong>string</strong> 112<br />

mount: mount-point /<strong>string</strong> does not exist. 113<br />

mount: the state of /dev/dsk/<strong>string</strong> is not okay 113<br />

Multihop attempted 114<br />

mbuf map full 114<br />

Memory address alignment 114<br />

memory leaks 114<br />

Message too long 115<br />

mount: /dev/dsk/<strong>string</strong> is already<br />

mounted, /<strong>string</strong> is busy, or... 115<br />

mount: giving up on: /<strong>string</strong> 116<br />

mount: mount-point /<strong>string</strong> does not exist. 116<br />

mount: the state of /dev/dsk/<strong>string</strong> is not okay 117<br />

Multihop attempted 117<br />

"N" 118<br />

Name not unique on network 118<br />

named [pid]: hostname.domainname has CNAME<br />

and other data (illegal) 118<br />

/net/<strong>string</strong>: No such file or directory 119<br />

Network dropped connection because of reset 120<br />

xii Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Network is down 120<br />

Network is unreachable 120<br />

NFS getattr failed for server <strong>string</strong>:<br />

RPC: Timed out 121<br />

nfs mount: Couldn’t bind to reserved port 122<br />

NFS mounted callog file Unsupported. 122<br />

nfs mount: mount: <strong>string</strong>: Device busy 123<br />

NFS mount: /<strong>string</strong> mounted OK 123<br />

NFS read failed for server <strong>string</strong> 124<br />

nfs_server: bad getargs for int/int 124<br />

NFS server <strong>string</strong> not responding still trying 125<br />

NFS server <strong>string</strong> ok 125<br />

NFS <strong>string</strong> failed for server <strong>string</strong>: error int (<strong>string</strong>) 126<br />

nfs umount: <strong>string</strong>: is busy 126<br />

NFS write error on host <strong>string</strong>: No space<br />

left on device. 126<br />

NFS write failed for server <strong>string</strong>: RPC: Timed out 127<br />

NIS+ authentication failure 128<br />

nis_cachemgr: Error in reading NIS cold start file :<br />

’/var/nis/NIS_COLD_START’ 128<br />

No buffer space available 129<br />

No child processes 129<br />

No default media available 130<br />

No directory! Logging in with home=/ 130<br />

No message of desired type 131<br />

No recipients specified 132<br />

No record locks available 132<br />

No route to host 132<br />

No shell Connection closed 133<br />

Contents<br />

xiii


No space left on device 133<br />

No such device 134<br />

No such device or address 134<br />

No such file or directory 135<br />

no such map in server’s domain 135<br />

No such process 136<br />

No such user as <strong>string</strong>-- cron entries not created 136<br />

Not a data message 137<br />

Not a directory 137<br />

Not a stream device 138<br />

Not enough space 138<br />

not found 139<br />

NOTICE: vxvm: unexpected status on close 139<br />

NOTICE: /<strong>string</strong>: out of inodes 141<br />

Not login shell 141<br />

Not on system console 142<br />

Not owner 142<br />

Not supported 142<br />

No utmpx entry 143<br />

6/04/98 7:27:54 nsrck: SYSTEM error,<br />

more space needed to compress [client]<br />

index, 8.1 MB required 144<br />

"O" 145<br />

Object is remote 145<br />

ok 145<br />

Operation already in progress 146<br />

Operation canceled 146<br />

operation failed [error 185], unknown<br />

group error 0, <strong>string</strong> 146<br />

xiv Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Operation not applicable 147<br />

"P" 150<br />

Operation not supported on transport endpoint 147<br />

Operation now in progress 148<br />

/opt/bin/jws: /solaris/bin/locate_dirs: not found 148<br />

Option not supported by protocol 148<br />

out of memory 149<br />

Out of stream resources 149<br />

overlapping swap volume 149<br />

Package not installed 150<br />

panic -boot: Could not mount filesystem 150<br />

panic: mutex_adaptive_exit 152<br />

Panic 152<br />

PARTIALLY ALLOCATED INODE I=int CLEAR? 153<br />

passwd: Changing password for <strong>string</strong> 154<br />

passwd.org_dir: NIS+ servers unreachable 155<br />

Password does not decrypt secret key<br />

for unix.uid@<strong>string</strong> 155<br />

"Q" 159<br />

"R" 159<br />

Permission denied 156<br />

Please specify a recipient. 156<br />

Protocol error 156<br />

protocol error, <strong>string</strong> closed connection 157<br />

Protocol family not supported 157<br />

Protocol not supported 158<br />

Protocol wrong type for socket 158<br />

quotactl: open Is a directory 159<br />

Contents<br />

xv


Read error from network: Connection reset by peer 159<br />

"S" 166<br />

Read-only file system 160<br />

rebooting... 160<br />

Recipient names must be specified 160<br />

Reset tty pgrp from int to int 161<br />

Resource temporarily unavailable 161<br />

Restartable system call 162<br />

Result too large 162<br />

rlogin: no directory! connection closed 163<br />

rmdir: <strong>string</strong>: Directory not empty 164<br />

ROOT LOGIN /dev/console 164<br />

ROOT LOGIN /dev/pts/int FROM <strong>string</strong> 164<br />

RPC: Program not registered 165<br />

rx framing error 165<br />

save: SYSTEM error, Arg list too long 166<br />

SCSI bus DATA IN phase parity error 166<br />

SCSI transport failed: reason ’reset’ 167<br />

Security exception on host <strong>string</strong>. USER<br />

ACCESS DENIED. 168<br />

Segmentation Fault 168<br />

sendmail[]: can’t lookup data via name<br />

sendmail[]: can’t lookup data via name<br />

server "nis" 169<br />

server "dns" or<br />

sendmail[int]: NOQUEUE: SYSERR: net hang<br />

reading from <strong>string</strong> 170<br />

Service wouldn’t let us acquire selection 170<br />

setmnt: Cannot open /etc/mnttab for writing 171<br />

share_nfs: /home: Operation not applicable 171<br />

xvi Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Slice c0t1d0s0 is too small to contain 1 replicas 171<br />

Socket type not supported 172<br />

Soft error rate (int%) during writing was too high 172<br />

Soft error rate (retries = int) during<br />

was too high 173<br />

writing<br />

"T" 180<br />

Software caused connection abort 173<br />

Srmount error 174<br />

Stale NFS file handle 174<br />

statd: cannot talk to statd at <strong>string</strong> 175<br />

stty: TCGETS: Operation not supported on socket 175<br />

su: No shell 176<br />

SunPC may NOT run correctly as root 177<br />

su: ’su root’ failed for <strong>string</strong> on /dev/pts/int 177<br />

su: ’su root’ succeeded for <strong>string</strong> on /dev/pts/int 178<br />

syncing file systems... 178<br />

SYSLOGD CAUSES SYSTEM HANGS 179<br />

syslog service starting. 179<br />

System booting after fatal error FATAL 179<br />

system hang 180<br />

SYSTEM HANGS DURING BOOT 180<br />

tar: /dev/rmt/0: No such file or directory 180<br />

tar: directory checksum error 181<br />

tar: tape write error 181<br />

Text file busy 182<br />

Text is lost because the maximum edit<br />

log size has been exceeded. 182<br />

THE FOLLOWING FILE SYSTEM(S) HAD AN<br />

UNEXPECTED INCONSISTENCY: 182<br />

Contents<br />

xvii


The SCSI bus is hung. Perhaps an<br />

external device is turned off. 183<br />

THE SYSTEM IS BEING SHUT DOWN NOW !!! 184<br />

The system will be shut down in int minutes 184<br />

This gateway does not support Unix Password. 185<br />

This mail file has been changed by<br />

another mail reader. 185<br />

"U" 190<br />

Timeout waiting for ARP/RARP packet 186<br />

Timer expired 186<br />

token ring hangs 187<br />

Too many links 187<br />

Too many open files 187<br />

Transport endpoint is already connected 188<br />

Transport endpoint is not connected 188<br />

TRAP 3E 189<br />

ufsdump 4mm commands 190<br />

umount: warning: /<strong>string</strong> not in mnttab 190<br />

Unable to connect to license server.<br />

Inconsistent encryption code. 191<br />

unable to get pty! 191<br />

Unable to install/attach driver ’<strong>string</strong>’ 192<br />

Unable to open nwrecover, Error:<br />

nwrecover: NSR: please start a server on client_name 192<br />

uname: error writing name when booting 193<br />

undefined control 193<br />

Unmatched ‘ 193<br />

UNREF FILE I=i OWNER=o MODE=m SIZE=s<br />

MTIME=t CLEAR? 194<br />

Use "logout" to logout. 194<br />

xviii Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


user unknown 195<br />

/usr/dt/bin/rpc.ttdbserverd:Child Status’ changed 195<br />

/usr/openwin/bin/xinit: connection to<br />

X server lost 196<br />

/usr/ucb/cc: language optional software<br />

package not installed 196<br />

UX: userdel: error: Cannot update<br />

system files login cannot be deleted 197<br />

"V" 198<br />

Value too large for defined data type 198<br />

Volume Manager reports error: 198<br />

vxconfigd error: segmentation fault 199<br />

vxvm:vxslicer:ERROR unsupported disk layout 200<br />

"W" 200<br />

WARNING: add_spec: No major number for sf 200<br />

WARNING: Clock gained int days-- CHECK<br />

AND RESET THE DATE! 201<br />

WARNING: No network locking on <strong>string</strong>:<br />

contact admin to install server change 202<br />

WARNING: processor level 4 interrupt not serviced 202<br />

WARNING: /tmp: File system full, swap<br />

space limit exceeded 203<br />

WARNING: TOD clock not initialized--<br />

CHECK AND RESET THE DATE! 203<br />

WARNING: Unable to repair the /<br />

filesystem. Run fsck 204<br />

"X" 206<br />

Watchdog Reset 204<br />

Who are you? 205<br />

Window Underflow 205<br />

X connection to <strong>string</strong>:0.0 broken<br />

(explicit kill or server shutdown). 206<br />

Contents<br />

xix


xinit: not found 207<br />

XIO: fatal IO error 32 (Broken pipe)<br />

on X server "<strong>string</strong>:0.0" 207<br />

Xlib: connection to "<strong>string</strong>:0.0" refused by server 207<br />

Xlib: extension "GLX" missing on display "0.0" 208<br />

xterm: fatal IO error 32 (Broken Pipe)<br />

or KillClient on X server "<strong>string</strong>:0.0" 209<br />

XView warning: Cannot load font set<br />

’<strong>string</strong>’ (Font Package) 210<br />

"Y" 210<br />

yp_all RPC clnt_call (transport level) failure 210<br />

ypbind[int]: NIS server for domain "<strong>string</strong>" OK 211<br />

ypbind[int]: NIS server not responding for domain "<br />

<strong>string</strong> "; still trying 211<br />

"Z" 212<br />

ypwhich: can’t communicate with ypbind 212<br />

zsint: silo overflow 212<br />

xx Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Preface<br />

System administrators and advanced users can use the Common Messages and<br />

Troubleshooting Guide to find explanations of some of the more common error<br />

messages in the Solaris system.<br />

Look up the messages and explanations here when you see a system message that<br />

you don’t understand. If the message you’re searching for is fairly common, it might<br />

be documented in this book.<br />

How This Book Is Organized<br />

Chapter 1,” explains how to find messages in both the AnswerBook Navigator and<br />

in the printed book.<br />

Chapter 2,” lists messages alphabetically, with troubleshooting information following<br />

each message listing.<br />

Special Symbols<br />

Three special symbols are used with the message explanations in this book.<br />

Preface<br />

xxi


See Also<br />

When further reading is suggested for a topic mentioned in the message explanation,<br />

the v symbol appears next to the first mention of the topic. This tells you to look in<br />

the message “See Also” section for sources of more information.<br />

New Line<br />

This symbol means that the part of this multi-line message following ¿ appears on a<br />

separate line.<br />

Deciphering Type Changes And Prompt<br />

Symbols<br />

The following table describes the typographic changes used in this book.<br />

AaBbCc123<br />

Variables, book titles, words<br />

to be emphasized<br />

To delete a file, type rm filename.<br />

Read Chapter 6 in User’s Guide.<br />

TABLE P–1<br />

Typographic Conventions<br />

Typeface or<br />

Symbol<br />

AaBbCc123<br />

Meaning<br />

Functions, commands,<br />

filenames, code, screen<br />

displays<br />

Example<br />

Edit your .login file.<br />

Use ls -a to list all files.<br />

The setlogmask() function sets...<br />

AaBbCc123<br />

name(num)<br />

What you type, contrasted<br />

with screen displays<br />

The manual page in the<br />

Solaris 2.7 Reference Manual<br />

AnswerBook<br />

machine_name% su<br />

Password:<br />

See ls(1).<br />

% C shell prompt %ls-a<br />

xxii Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


TABLE P–1 Typographic Conventions (continued)<br />

Typeface or<br />

Symbol<br />

Meaning<br />

Example<br />

$ Bourne or Korn shell prompt $ ls -a<br />

# Superuser prompt # ls -a<br />

xxiii


CHAPTER 1<br />

About Error Messages<br />

This book covers some of the more common error messages in the Solaris 7operating<br />

environment. Most messages covered here come from the operating system and the<br />

window system, but some come from commands, networking, and system<br />

administration (the section 1 and section 1M man pages).<br />

Searching for Messages<br />

Choosing What To Look For<br />

How you choose to look up a particular message depends on<br />

4 how the message is constructed<br />

4 whether you are searching in a printed book or in the AnswerBook documentation<br />

Variable Words and Numbers<br />

Remember as you are searching that some words and numbers in messages vary<br />

when the messages are displayed. For example, the following message uses the name<br />

of the server affected, b5server in this case:<br />

NFS read failed for server b5server<br />

When message words or numbers vary, this book uses the words variable and number<br />

in the italics type face. So the previous message is listed in this book as:<br />

NFS read failed for server variable<br />

1


Variable words and numbers can appear anywhere in a message, even at the<br />

beginning. Because of this, messages are alphabetized by the first nonreplaced word<br />

or number in the message.<br />

Frequently Duplicated Parts of Messages<br />

Many messages you see are actually combined messages, often beginning with a<br />

program name. The five error messages in the following example are basically the<br />

same even though the command names are different.<br />

4 find: out of memory<br />

4 grep: out of memory<br />

4 ls: out of memory<br />

4 mount: out of memory<br />

4 fsck: out of memory<br />

Rather than document this message at least five times, it appears in this book as the<br />

message “out of memory.” Messages that contain colons (:) are often combined<br />

messages, and you might find that explanations of message sections are available<br />

separately.<br />

So, if you don’t find the beginning of a message in the book, and the message<br />

contains colons, search for other parts of the message.<br />

In the Printed Book<br />

Methods for finding a particular message vary depending on whether you are<br />

looking at a printed book or are searching online with the AnswerBook Navigator.<br />

To find a message in the printed book, you can search the table of contents (which is<br />

an alphabetical listing of the messages) or the main body of the manual, Chapter 2.<br />

In the AnswerBook Navigator<br />

While print search methods work in AnswerBook, too, it’s much faster to search for<br />

messages through the search utility in the AnswerBook Navigator.<br />

1. Bring up AnswerBook<br />

$ answerbook<br />

2. Click Select on the Search button<br />

3. Enter the words or pattern to search for in the “Search Library For:” pane<br />

2 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


4. Double-click Select on an entry in the resulting list. Although any of the entries<br />

might contain the information you’re looking for, those from this book are most<br />

likely to be what you want.<br />

If your first search doesn’t find the message, consider altering the search pattern.<br />

Remember that this book contains only a small percentage of possible messages.<br />

In general, you are most likely to find a documented message in the AnswerBook<br />

search pane when you enclose the searched-for words in quotation marks (““) or in<br />

parentheses ().<br />

Using Pattern Matching<br />

You can search in the AnswerBook Navigator for text containing specific single<br />

words, phrases that contain spaces, words near one another, and word variations.<br />

See“Using the AnswerBook Software” in OpenWindows User’s Guide for more<br />

detailed information about the AnswerBook search.<br />

TABLE 1–1<br />

AnswerBook Search Pattern Matching<br />

To search for Such as Use<br />

Single words installing, le0, group The words<br />

Phrases with spaces Installing Packages Quotation marks (““) around<br />

the phrase<br />

Words near one another Installing...Server Parentheses ( ( ))<br />

Word variations delete, deleting, deletion Asterisks (*) and hyphens (-)<br />

The following example shows some of the possible matches for specific AnswerBook<br />

Navigator searches.<br />

About Error Messages 3


TABLE 1–2<br />

AnswerBook Search Results<br />

Searching With<br />

Installing<br />

“Installing Packages”<br />

(Installing Server)<br />

Delet*<br />

Finds These (for Example)<br />

Installing XIL Device Handlers (XIL Device Porting and<br />

Extensibility Guide)<br />

Installing Packages on a Server for... (Application Packaging<br />

Developer’s Guide)<br />

Installing Packages on a Server for... (Application Packaging<br />

Developer’s Guide)<br />

Installing Packages for Clients on a Server (Software and<br />

AnswerBook Packages...)<br />

Installing Packages on a Server for... (Application Packaging<br />

Developer’s Guide)<br />

Creating an Install Server (SPARC Installing Solaris Software)<br />

Delete All Silence (Solaris Advanced User’s Guide)<br />

Deleting a Line (Solaris Advanced User’s Guide)<br />

Deletion of the New Selection (OLIT Reference Manual)<br />

If you are unable to find an error message documented, please report it to us by<br />

sending e-mail to msgdoc@Eng.Sun.COM (this address is an autoresponder alias, not<br />

an actual person).<br />

Combining Search Techniques<br />

Combine the above search techniques to further refine your search. For example,<br />

“chang* mail-tool” finds documents containing phrases such as “change mailtool,”<br />

“change mail tool,” “change mail-tool,” “changing mailtool,” and so on.<br />

Understanding the Message<br />

Explanations<br />

Each message contains at least one of the following areas:<br />

4 Cause: What might have happened to cause the message<br />

4 Action: What you can do to fix the problem or, to continue with your work<br />

4 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


4 Technical Notes: Background information that might be interesting or helpful to a<br />

technical audience. This often contains information specifically for programmers<br />

Whenever you see part of a message that says “errno=” and then a number, look<br />

up the number on the Intro(2) man page to see what it indicates. System error<br />

messages on the Intro(2) man page are organized numerically.<br />

4 See Also: Suggests further reading<br />

About Error Messages 5


6 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


CHAPTER 2<br />

Alphabetical Message Listing<br />

Messages listed here are in the current Solaris 7 system. Many were also present in<br />

earlier Solaris 2 systems.<br />

Messages are listed alphabetically.<br />

Numbers and Symbols<br />

***** FILE SYSTEM WAS MODIFIED *****<br />

Cause<br />

This comment from the fsck(1M) command tells you that it changed the filesystem<br />

it was checking.<br />

Action<br />

If fsck(1M) was checking the root filesystem, reboot the system immediately to<br />

avoid corrupting the / partition. If fsck(1M) was checking a mounted filesystem,<br />

unmount that filesystem and run fsck(1M) again, so that work done by fsck(1M)<br />

is not undone when in-memory file tables are written out to disk.<br />

7


** Phase 1-- Check Blocks and Sizes<br />

Cause<br />

The fsck(1M) command is checking the filesystem shown in the messages that are<br />

displayed before this one. The first phase checks the inode list, finds bad or duplicate<br />

blocks, and verifies the inode size and format.<br />

Action<br />

If more than a dozen errors occur during this important phase, you might want to<br />

restore the filesystem from backup tapes. Otherwise it is fine to proceed with<br />

fsck(1M).<br />

See Also<br />

For more information, see the chapter on checking filesystem integrity in the System<br />

Administration Guide, Volume I.<br />

** Phase 1b-- Rescan For More DUPS<br />

Cause<br />

The fsck(1M) command detected duplicate blocks while checking a filesystem, so<br />

fsck(1M) is rescanning the filesystem to find the inode that originally claimed that<br />

block.<br />

Action<br />

If fsck(1M) executes this optional phase, you will see additional DUP/BAD<br />

messages in phases 2 and 4.<br />

See Also<br />

For more information, see the chapter on checking filesystem integrity in the System<br />

Administration Guide, Volume I.<br />

8 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


** Phase 2-- Check Pathnames<br />

Cause<br />

The fsck(1M) command is checking a filesystem, and fsck(1M) is now removing<br />

directory entries pointing to bad inodes that were discovered in phases 1 and 1b.<br />

This phase might ask you to remove files, salvage directories, fix inodes, reallocate<br />

blocks, and so on.<br />

Action<br />

If more than a dozen errors occur during this important phase, you might want to<br />

restore the filesystem from backup tapes. Otherwise it is fine to proceed with<br />

fsck(1M).<br />

See Also<br />

For more information, see the chapter on checking filesystem integrity in the System<br />

Administration Guide, Volume I.<br />

** Phase 3-- Check Connectivity<br />

Cause<br />

The fsck(1M) command is checking a filesystem, and fsck(1M) is now verifying<br />

the integrity of directories. You might be asked to adjust, create, expand, reallocate,<br />

or reconnect directories.<br />

Action<br />

You can usually answer yes to all these questions without harming the filesystem.<br />

See Also<br />

For more information, see the chapter on checking filesystem integrity in the System<br />

Administration Guide, Volume I.<br />

Alphabetical Message Listing 9


** Phase 4-- Check Reference Counts<br />

Cause<br />

The fsck(1M) command is checking a filesystem, and fsck(1M) is now checking<br />

link count information obtained in phases 2 and 3. You might be asked to clear or<br />

adjust link counts.<br />

Action<br />

You can usually answer yes to all these questions without harming the filesystem.<br />

See Also<br />

For more information, see the chapter on checking filesystem integrity in the System<br />

Administration Guide, Volume I.<br />

** Phase 5-- Check Cyl groups<br />

Cause<br />

The fsck(1M) command is checking a filesystem, and fsck(1M) is now checking<br />

the free-block and used-inode maps. You might be asked to salvage free blocks or<br />

summary information.<br />

Action<br />

You can usually answer yes to all these questions without harming the filesystem.<br />

See Also<br />

For more information, see the chapter on checking filesystem integrity in the System<br />

Administration Guide, Volume I.<br />

10 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


29a00 illegal instruction<br />

Cause<br />

When trying to boot a client from a boot/jumpstart server to install/upgrade a<br />

workstation, it fails with the following message:<br />

boot net - install<br />

Rebooting with command: net - install<br />

Boot device: /iommu/sbus/ledma@f, 400010/le@f, 8c0000 File and args: -<br />

install<br />

29a00 Illegal Instruction<br />

(0) ok<br />

Action<br />

The problem lies in the /tftpboot directory of the boot server. Confirm that the<br />

HOSTID and HOSTID.ARCH files are linked to the correct inetboot file for your<br />

architecture. The following is an example of how a symbolic link should look:<br />

# cd /tftpboot<br />

# ls -l 81971904*<br />

81971904 -> inetboot.sun4m.Solaris_2.4<br />

81971904.SUN4M -> inetboot.sun4m.Solaris_2.4<br />

The following is an example of an incorrect entry for a sun4m system:<br />

C753002F -> inetboot.axil4m.Solaris_2.5.1<br />

C753002F.AXIL4M -> inetboot.axil4m.Solaris_2.5.1<br />

If the entries are not correct, remove the entry for the particular client in this<br />

directory, using rm_install_client or rm_client commands, and re-add the<br />

client with the add_install_client(1M) or add_client command or through<br />

Solstice giving the correct architecture.<br />

Alphabetical Message Listing 11


451 timeout waiting for input during<br />

source<br />

Cause<br />

When sendmail(1M) reads from anything that might time out, such as an SMTP<br />

connection, it sets a timer to the value of the r processing option before reading<br />

begins. If the read doesn’t complete before the timer expires, this message appears<br />

and reading stops. (Usually this is during RCPT.) The mail message is then queued<br />

for later delivery.<br />

Action<br />

If you see this message often, increase the value of the r processing option in the<br />

/etc/mail/sendmail.cf file. If the timer is already set to a large number, look for<br />

hardware problems such as poor network cabling or connections.<br />

See Also<br />

For more information about setting the timer, see the section describing the<br />

sendmail(1M) configuration options in the Mail Administration Guide. If you are<br />

using the AnswerBook, the term "timeouts" is a good search <strong>string</strong>.<br />

550 hostname... Host unknown<br />

Cause<br />

This sendmail(1M) message indicates that the destination host machine, specified<br />

by the address portion after the @ (at-sign), was not found during DNS (Domain<br />

Naming System) lookup.<br />

Action<br />

Use the nslookup(1M) command to verify that the destination host exists in that or<br />

other domains, perhaps with a slightly different spelling. Failing that, contact the<br />

intended recipient and ask for a proper address.<br />

Sometimes this return message indicates that the intended host is down, rather than<br />

unknown. If a DNS record contains an unknown alternate host, and the primary host<br />

is down, sendmail(1M) returns a "Host unknown" message from the alternate host.¤<br />

12 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


For uucp(1C) mail addresses, the "Host unknown" message probably means that the<br />

destination hostname is not listed in the /etc/uucp/Systems file.<br />

Technical Notes<br />

¤ This is a known sendmail(1M) version 8.6.7 bug.<br />

See Also<br />

For information on how sendmail(1M) works, see the Mail Administration Guide.<br />

550 username... User unknown<br />

Cause<br />

This sendmail(1M) message indicates that the intended recipient, specified by the<br />

address portion before the @ (at-sign), could not be located on the destination host<br />

machine.<br />

Action<br />

Check the e-mail address and try again, perhaps with a slightly different spelling. If<br />

this doesn’t work, contact the intended recipient and ask for a proper address.<br />

See Also<br />

For information on how sendmail(1M) works, see the Mail Administration Guide.<br />

554 hostname<br />

... Local configuration error<br />

Cause<br />

This sendmail(1M) message usually indicates that the local host is trying to send<br />

mail to itself.<br />

Alphabetical Message Listing 13


Action<br />

Check the value of the $j macro in the /etc/mail/sendmail.cf file to ensure<br />

that this value is a fully-qualified domain name.<br />

Technical Notes<br />

When the sending system provides its hostname to the receiving system (in the<br />

SMTP HELO command), the receiving system compares its name to the sender’s name.<br />

If these are the same, the receiving system issues this error message and closes the<br />

connection. The name provided in the HELO command is the value of the $j macro.<br />

See Also<br />

For information on how sendmail(1M) works, see the Mail Administration Guide.<br />

"A"<br />

Accessing a corrupted shared library<br />

Cause<br />

Trying to exec(2) an a.out that requires a static shared library (to be linked in) and<br />

exec(2) could not load the static shared library. The static shared library is probably<br />

corrupted.<br />

Technical Notes<br />

The symbolic name for this error is ELIBBAD, errno=84.<br />

14 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


access violation unknown host<br />

IP address<br />

Cause<br />

Solstice Backup Utility fails displaying the error:<br />

access violation unknown host IP address on Networker 4.2.2. This is<br />

usually caused by corrupted host name in the host NIS/NIS+ map/table.<br />

Action<br />

Check the Networker client configuration for an incorrect host name. If all else fail, a<br />

workaround is to add the entry to /etc/hosts.<br />

A command window has exited because<br />

its child exited.<br />

Cause<br />

The argument to a cmdtool(1) or a shelltool(1) window looks like it is supposed to be a<br />

command, but the system cannot find the command.<br />

Action<br />

To run this command inside a cmdtool(1) or a shelltool(1), make sure the command is<br />

spelled correctly and is in your search path (if necessary, use a full path name). If<br />

you intended this argument as an option setting, use a minus sign (-) at the<br />

beginning of the option.<br />

Technical Notes<br />

Both the cmdtool(1) and the shelltool(1) are OpenWindows terminal emulators.<br />

Alphabetical Message Listing 15


Address already in use<br />

Cause<br />

User attempted to use an address already in use, and the protocol does not allow this.<br />

Technical Notes<br />

The symbolic name for this error is EADDRINUSE, errno=125.<br />

Address family not supported by<br />

protocol family<br />

Cause<br />

An address incompatible with the requested protocol was used.<br />

Technical Notes<br />

The symbolic name for this error is EAFNOSUPPORT, errno=124.<br />

admintool: Received communication<br />

service error 4<br />

Cause<br />

AdminTool could not start a display method because a remote procedure call timed<br />

out, so it can’t send the request. This error results when admintool(1M) tries to<br />

access the NIS or NIS+ tables when networking is not enabled.<br />

Action<br />

Verify the system network status with ifconfig -a to make sure the system is<br />

connected to the network. Make sure the ethernet cable is connected and the system<br />

is configured to run NIS or NIS+.<br />

16 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Advertise error<br />

Cause<br />

This error is RFS specific. It occurs when users try to advertise a resource already<br />

advertised, try to stop RFS while there are resources still advertised, or try to force<br />

unmount a resource when it is still advertised.<br />

Technical Notes<br />

The symbolic name for this error is EADV, errno=68.<br />

answerbook: XView error: NULL pointer<br />

passed to xv_set<br />

Cause<br />

The AnswerBook navigator window comes up, but the document viewer window<br />

does not. This message appears on the console, and the message "Could not start<br />

new viewer" appears in the navigator window. This situation indicates that you have<br />

an unknown client or a problem with the network naming service.<br />

Action<br />

Run the ypmatch(1) or nismatch(1) command o determine if the client hostname<br />

is in the hosts map. If it isn’t, add it to to NIS hosts map on the NIS master server.<br />

Then make sure the /etc/hosts file on the client contains an IP address and entry<br />

for that hostname followed by loghost (reboot if you changed the /etc/hosts<br />

file). Check that the ypmatch(1) or nismatch(1) client hosts command returns the<br />

same IP host address as in the /etc/hosts file. Finally, quit all existing<br />

AnswerBooks and restart.<br />

See Also<br />

For more information on the NIS hosts map, see the section on the default search<br />

criteria in the NIS+ and FNS Administration Guide. If you are using the AnswerBook,<br />

"NIS hosts map" is a good search <strong>string</strong>.<br />

Alphabetical Message Listing 17


Arg list too long<br />

Cause<br />

The system could not handle the number of arguments given to a command or<br />

program when it combined those arguments with the environment’s exported shell<br />

variables. The argument list limit is the size of the argument list plus the size of the<br />

environment’s exported shell variables.<br />

Action<br />

The easiest solution is to reduce the size of the parent process environment by<br />

unsetting extraneous environment variables. (See the man page for the shell you’re<br />

using to find out how to list and change your environment variables.) Then run the<br />

program again.<br />

Technical Notes<br />

An argument list longer than ARG_MAX bytes was presented to a member of the<br />

exec(2) family of system calls.<br />

The symbolic name for this error is E2BIG, errno=7.<br />

Argument out of domain<br />

Cause<br />

This is a programming error or a data input error.<br />

Action<br />

Ask the program’s author to fix this condition, or supply data in a different format.<br />

Technical Notes<br />

This indicates an attempt to evaluate a mathematical programming function at a<br />

point where its value is not defined. The argument of a programming function in the<br />

math package (3M) is out of the domain of the function. This could happen when<br />

taking the square root, power, or log of a negative number, when computing a power<br />

18 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


to a non-integer, or when passing an out-of-range argument to a hyperbolic<br />

programming function.<br />

To help pinpoint a program’s math errors, use the matherr(3M) facility.<br />

The symbolic name for this error is EDOM, errno=33.<br />

Arguments too long<br />

Cause<br />

This C shell error message indicates that there are too many arguments after a<br />

command. For example, this can happen by invoking rm * in a huge directory. The<br />

C shell cannot handle more than 1706 arguments.<br />

Action<br />

Temporarily start a Bourne shell with sh(1) and run the command again. The<br />

Bourne shell dynamically allocates command line arguments. Return to your original<br />

shell by typing exit.<br />

assertion failed: <strong>string</strong>, file name,<br />

line int<br />

Cause<br />

An unexoected condition in the program has happened.<br />

Action<br />

Contact the vendor or author of the program to ask why it failed. If you have the<br />

source code for the program, you can look at the file and line number where the<br />

assertion failed. This might give you an idea of how to run the program differently.<br />

Technical Notes<br />

This message results from a diagnostic macro called assert(3C) that a programmer<br />

inserted into the specified line of a source file. The expression that evaluated untrue<br />

precedes the file name and line number.<br />

Alphabetical Message Listing 19


Attempting to link in more shared<br />

libraries than system limit<br />

Cause<br />

Trying to exec(2) an a.out that requires more static shared libraries than is<br />

allowed on the current configuration of the system.<br />

Technical Notes<br />

The symbolic name for this error is ELIBMAX, errno=86.<br />

automountd[int]: server hostname<br />

responding<br />

Cause<br />

This automounter message indicates that the system tried to mount a filesystem from<br />

an NFS server that is either down or extremely slow to respond. In some cases this<br />

message indicates that the network link to the NFS server is broken, although that<br />

condition produces other error messages as well.<br />

Action<br />

If you are the system administrator responsible for the non-responding NFS server,<br />

check to see whether the machine needs repair or rebooting. Encourage your user<br />

community to report such problems quickly but only once. When the NFS server is<br />

back in operation, the automounter will be able to access the requested filesystem.<br />

See Also<br />

For more information on NFS failures, see the section on NFS troubleshooting in the<br />

NFS Administration Guide. If you are using the AnswerBook, a good search <strong>string</strong> is<br />

"NFS Service."<br />

20 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


automount[int]: name: Not a directory<br />

Cause<br />

The file specified after the first colon is not a valid mount point because it is not a<br />

directory.<br />

Action<br />

Ensure that the mount point is a directory and not a regular file or a symbolic link.<br />

"B"<br />

Bad address<br />

Cause<br />

The system encountered a hardware fault in attempting to access a parameter of a<br />

programming function.<br />

Action<br />

Check the address to see if it resulted from supplying the wrong device or option to<br />

a command. If that is not the problem, contact the vendor or author of the program<br />

for an update.<br />

Technical Notes<br />

This error could occur any time a function that takes a pointer argument is passed an<br />

invalid address. Because processors differ in their ability to detect bad addresses, on<br />

some architectures passing bad addresses can result in undefined behaviors.<br />

The symbolic name for this error is EFAULT, errno=14.<br />

Alphabetical Message Listing 21


BAD/DUP FILE I=i OWNER=o MODE=m SIZE=s<br />

MTIME=t CLEAR?<br />

Cause<br />

While checking inode link counts during phase 4, fsck(1M) found a file (or<br />

directory) that either does not exist or exists somewhere else.<br />

Action<br />

To clear the inode of its reference to this file or directory, answer yes. With the -p<br />

(preen) option, fsck(1M) automatically clears bad or duplicate file references, so<br />

answering yes to this question seldom causes a problem.<br />

Bad file number<br />

Cause<br />

Generally this is a program error, not a usage error.<br />

Action<br />

Contact the vendor or author of the program for an update.<br />

Technical Notes<br />

Either a file descriptor refers to no open file, or a read(2) (or write(2)) request is<br />

made to a file that is open only for writing (or reading).<br />

The symbolic name for this error is EBADF, errno=9.<br />

block no. BAD I=inode no.<br />

Cause<br />

Upon detecting an out-of-range block, fsck(1M) prints the bad block number and<br />

its containing inode (after I=).<br />

22 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

In fsck(1M) phases 2 and 4, you decide whether or not to clear these bad blocks.<br />

Before committing to repair with fsck(1M), you could determine which file<br />

contains this inode by passing the inode number to the ncheck(1M) command:<br />

# ncheck -iinum filesystem<br />

See Also<br />

For more information, see the chapter on checking filesystem integrity in the System<br />

Administration Guide, Volume I.<br />

BAD_MESSAGE (error code 100) from<br />

X.400<br />

Cause<br />

X.400 software was working fine. Suddenly message exchanges failed in<br />

ma_start_delivery(). It was returning an error code of 100 (BAD_MESSAGE).<br />

ma_start_delivery() call fails when trying to exhange file of more than 900<br />

bytes.<br />

Action<br />

X.400 was restarted with the wrong umask.<br />

et the umask to 0022 and restart software to fix.<br />

bad module/chip at: position<br />

Cause<br />

This message from the memory management system often appears with parity<br />

errors, and indicates a bad memory module or chip at the position listed. Data loss is<br />

possible if the problem occurs other than at boot time.<br />

Alphabetical Message Listing 23


Action<br />

Replace the memory module or chip at the indicated position. Refer to the vendor’s<br />

hardware manual for help finding this location.<br />

Bad request descriptor<br />

Cause<br />

Apparently used only in NIS+ to indicate corrupted or missing tables.<br />

Technical Notes<br />

The symbolic name for this error is EBADR, errno=51.<br />

BAD SUPER BLOCK: <strong>string</strong><br />

Cause<br />

This message from fsck(1M) indicates that a filesystem’s super-block is damaged<br />

beyond repair and must be replaced. At boot time (with the -p option) this message<br />

is prefaced by the filesystem’s device name. After this message comes the actual<br />

damage recognized (see Action). Unfortunately fsck(1M) does not print the<br />

number of the damaged super-block.<br />

Action<br />

The most common cause of this error is overlapping disk partitions. Do not<br />

immediately rerun fsck(1M) as suggested by the lines that display after the error<br />

message. First make sure that you have a recent backup of the filesystem involved; if<br />

not, try to back up the filesystem now using ufsdump(1M). Then run the<br />

format(1M) command, select the disk involved, and print out the partition<br />

information.<br />

# format<br />

: N<br />

> partition<br />

(continued)<br />

24 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


(Continuation)<br />

> print<br />

Note whether the overlap occurs at the beginning or end of the filesystem involved.<br />

Then run newfs(1M) with the -N option to print out the filesystem parameters,<br />

including the location of backup super-blocks.<br />

# newfs -N /dev/dsk/device<br />

Select a super-block from a non-overlapping area of the disk, but note that in most<br />

cases you have only one chance to select the proper replacement super-block, which<br />

fsck(1M) soon propagates to all the cylinders. If you select the wrong replacement<br />

super-block, data corruption will probably occur, and you will have to restore from<br />

backup tapes. After you select a new super-block, provide fsck(1M) with the new<br />

master super-block number:<br />

# fsck -o b=NNNN /dev/dsk/device<br />

Technical Notes<br />

Specific reasons for a damaged super-block include: a wrong magic number, out of<br />

range NCG (number of cylinder groups) or CPG (cylinders per group), the wrong<br />

number of cylinders, a preposterously large super-block size, and trashed values in<br />

super-block. These reasons are generally not meaningful because a corrupt<br />

super-block is usually extremely corrupt.<br />

See Also<br />

For more information on bad superblocks, see the sections on restoring bad<br />

superblocks in the System Administration Guide, Volume I. If you are using the<br />

AnswerBook, "superblock" is a good search <strong>string</strong>.<br />

Alphabetical Message Listing 25


BAD TRAP<br />

Cause<br />

A bad trap can indicate faulty hardware or a mismatch between hardware and its<br />

configuration information. Data loss is possible if the problem occurs other than at<br />

boot time.<br />

Action<br />

If you recently installed new hardware, verify that the software was correctly<br />

configured. Check the kernel traceback displayed on the console to see which device<br />

generated the trap. If the configuration files are correct, you will probably have to<br />

replace the device.<br />

In some cases, the bad trap message indicates a bad or down-rev CPU.<br />

Technical Notes<br />

A hardware processor trap occurred, and the kernel trap handler was unable to<br />

restore system state. This is a fatal error that usually precedes a panic, after which<br />

the system performs a sync, dump, and reboot. The following conditions can cause a<br />

bad trap: a system text or data access fault, a system data alignment error, or certain<br />

kinds of user software traps.<br />

/bin/sh: file: too big<br />

Cause<br />

This Bourne shell message indicates a classic "no memory" error. While trying to load<br />

the program specified after the first colon, the shell noticed that the system ran out<br />

of virtual memory (swap space).<br />

Action<br />

See the message "Not enough space" for information on reconfiguring your system to<br />

add more swap space.<br />

26 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Block device required<br />

Cause<br />

A raw (character special) device was specified where a block device was required,<br />

such as during a call to the mount(1M) command.<br />

Action<br />

To see which block devices are available, use ls -l to look in /devices. Then<br />

specify a block device instead of a character device. Block device modes start with a<br />

b, whereas raw character device modes start with a c.<br />

Technical Notes<br />

The symbolic name of this error is ENOTBLK, errno=15.<br />

Boot device: /iommu/sbus/<br />

directory/directoryliteral>/sd@3,0<br />

Cause<br />

This message always appears at the beginning of rebooting. If there is a problem, the<br />

system hangs, and no other messages appear. This condition is caused by conflicting<br />

SCSI targets for the boot device, which is almost always target 3.<br />

Action<br />

The boot device is usually the machine’s internal disk drive, target 3. Make sure that<br />

external and secondary disk drives are targeted to 1, 2, or 0, and do not conflict with<br />

each other. Also make sure that tape drives are targeted to 4 or 5, and CD drives to<br />

6, avoiding any conflict with each other or with the disk drives. You can set a<br />

device’s target number using pushbutton switches or a dial on the back near the<br />

SCSI cables. If the targeting of the internal disk drive is in question, check it by<br />

powering off the machine, removing all external drives, turning the power on, and<br />

running the probe-scsi-all or probe-scsi command from the PROM monitor.<br />

Alphabetical Message Listing 27


oot error 13 | sun4m<br />

Cause<br />

Diskless client boot gave NFS mount error 13<br />

Cause<br />

Diskless client fails to boot, NFS error 13<br />

Broadcast Message from root (pts/int)<br />

on server [date]<br />

Cause<br />

This message from the wall(1M) command is transmitted to all users logged into a<br />

system. You could see it during a rlogin(1) or telnet(1) session, or on terminals<br />

connected to a timesharing system.<br />

Action<br />

Carefully read the broadcast message. Often this broadcast is followed by a<br />

shutdown warning.<br />

See the message "The system will be shut down in int minutes" for details about<br />

system shutdown.<br />

See Also<br />

For more information on bringing down the system, see the section on halting the<br />

system in the System Administration Guide, Volume I. If you are using the<br />

AnswerBook, "halting the system" is a good search <strong>string</strong>.<br />

28 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Broken pipe<br />

Cause<br />

This condition is often normal, and the message is merely informational (as when<br />

piping many lines to the head(1) program). The condition occurs when a write on a<br />

pipe does not find a reading process. This usually generates a signal to the executing<br />

program, but this message displays when the program ignores the signal.<br />

Action<br />

Check the process at the end of the pipe to see why it exited.<br />

Technical Notes<br />

The symbolic name of this error is EPIPE, errno=32.<br />

Bus Error<br />

Cause<br />

A process has received a signal indicating that it attempted to perform I/O to a<br />

device that is restricted or that does not exist. This message is usually accompanied<br />

by a core dump, except on read-only filesystems.<br />

Action<br />

Use a debugger to examine the core file and determine what program fault or system<br />

problem led to the bus error. If possible, check the program’s output files for data<br />

corruption that might have occurred before the bus error.<br />

Technical Notes<br />

Bus errors can result from either a programming error or device corruption on your<br />

system. Some common causes of bus errors are: invalid file descriptors, unreasonable<br />

I/O requests, bad memory allocation, misaligned data structures, compiler bugs, and<br />

corrupt boot blocks.<br />

Alphabetical Message Listing 29


"C"<br />

Cannot access a needed shared library<br />

Cause<br />

Trying to exec(2) an a.out that requires a static shared library and the static<br />

shared library doesn’t exist or the user doesn’t have permission to use it.<br />

Technical Notes<br />

The symbolic name for this error is ELIBACC, errno=83.<br />

Cannot allocate colormap entry for "<br />

<strong>string</strong> "<br />

Cause<br />

This message from libXt (X Intrinsics library) indicates that the system colormap<br />

was full even before the color name specified in quotes was requested. Some<br />

applications can continue after this message. Other applications, such as Workspace<br />

Properties Color, fail to come up when the colormap is full.<br />

Action<br />

Exit the programs that make heavy use of the colormap, then restart the failed<br />

application and try again.<br />

Cannot assign requested address<br />

Cause<br />

Results from an attempt to create a transport endpoint with an address not on the<br />

current machine.<br />

30 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Technical Notes<br />

The symbolic name for this error is EADDRNOTAVAIL, errno=126.<br />

cannot change passwd, not correct<br />

passwd<br />

Cause<br />

While running yppasswd(1) and trying to change a user’s password, it responded<br />

with the message cannot change passwd, not correct passwd.<br />

Also getting yppasswd user <strong>string</strong> does not exist on server console, but<br />

running ypcat passwd | grep (user) returns the username. Verified that<br />

yppasswdd(1M) is running.<br />

Action<br />

Check the passwd(4) file with pwck(1M) and verify that yppasswdd(1M) is<br />

running on the right server. Then verify where the passwd(4) file is located and if<br />

changed, check that yppasswdd(1M) has the location in the process line. The<br />

password located in /etc/yp should read<br />

/usr/lib/yp/rpc.yppasswdd -D /etc/yp. The -D option with the passwd files<br />

directory location tells yppasswdd(1M) where to update and verify password<br />

changes.<br />

Cannot exec a shared library directly<br />

Cause<br />

Attempting to exec(2) a shared library directly.<br />

Technical Notes<br />

The symbolic name for this error is ELIBEXEC, errno=87.<br />

Alphabetical Message Listing 31


Cannot find SERVER hostname in<br />

network database<br />

Cause<br />

A brief description: the user is on a different subnet and is running permanent<br />

licenses:<br />

ultra1(50)% cc -o hello hello.c<br />

License Error : Cannot find the license server (fry)<br />

in the network database for product(Sun WorkShop Compiler C)<br />

Cannot find SERVER hostname in network database (-14,7)<br />

cc: acomp failed for hello.c<br />

ultra1(51)%<br />

Action<br />

First, make sure that the server is up and running. Second, make sure that the server<br />

is in the /etc/hosts file of the client system by typing: ping server name. Third,<br />

check the license daemon on teh server to see if it is running. Fourth, make sure that<br />

there is an elementary license file on the client:<br />

cd /etc/opt/licenses<br />

more sunpro.loc<br />

Fifth, in the sunpro.loc directory are there only text license files such as<br />

sunpro.lic,1? Sixth, on the client check:<br />

% cd /etc<br />

% more nsswitch.conf | grep hosts<br />

hosts: nis [NOTFOUND=return] files<br />

This says that it will use the nis server to look up the IP address. If it is set first for<br />

nis and the /etc/hosts file has the server listed by name, change the line to<br />

hosts:<br />

files nis<br />

Then see if it can find it. If not, try truss and snoop to see what is going on.<br />

32 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Cannot open FCC file<br />

Cause<br />

When trying to send mail via Netscape, the message is displayed. The problem is<br />

that Netscape is trying to save the outbound message to a file that has been specified<br />

by the user, but does not exist.<br />

Action<br />

To correct this problem: go to options Mail and News Preferences, then go to<br />

Compose. A template pops up. There is a section that specifies where to save<br />

out-going mail and news files. Make sure that these files exist or remove them from<br />

the template, if you do not care about logging what messages are sent via Netscape.<br />

Cannot send after transport endpoint<br />

shutdown<br />

Cause<br />

A request to send data was disallowed because the transport endpoint has already<br />

been shut down.<br />

Technical Notes<br />

The symbolic name for this error is ESHUTDOWN, errno=143.<br />

Can’t create public message device<br />

(Device busy)<br />

Cause<br />

This message comes from the lp(1) print scheduler, indicating that it is either<br />

extremely busy or hung.<br />

Alphabetical Message Listing 33


Action<br />

If print jobs are coming out of the printer in question, wait until they are finished<br />

and then resubmit this print job. If you see this message again, the lp(1) system is<br />

probably hung.<br />

See the message "lp hang" for a procedure to clear the queue.<br />

Technical Notes<br />

If lp(1) is unable to create a device for printer messages, the message FIFO could be<br />

already in use, or locked by another print job.<br />

See Also<br />

For more information on the print scheduler, see the section on administrating<br />

printers in the System Administration Guide Volume II.<br />

Can’t invoke /etc/init, error int<br />

Cause<br />

This message can appear while a system is booting, indicating that the init(1M)<br />

program is missing or corrupted. Note that /etc/init is a symbolic link to<br />

/sbin/init.<br />

Action<br />

Boot the miniroot so you can replace init(1M). Halt the machine by typing<br />

Stop-A or by pressing the reset button. Reboot single-user from CD-ROM, the net,<br />

or diskette. For example, type boot cdrom -s at the ok prompt to boot from<br />

CD-ROM. After the system comes up and gives you a # prompt, mount the device<br />

corresponding to the original / partition somewhere, with a command similar to the<br />

mount(1M) command below. Then copy the init(1M) program from the miniroot<br />

to the original / partition, and reboot the system.<br />

# mount /dev/dsk/c0t3d0s0 /mnt<br />

# cp /sbin/init /mnt/sbin/init<br />

# reboot<br />

34 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


If this doesn’t work, other files might be corrupted, and you might need to reinstall<br />

the entire system.<br />

Technical Notes<br />

The error number is 2 if /sbin/init is missing, or 8 if /sbin/init has an<br />

incorrect executable format. This is usually followed by a "panic: icode" message. The<br />

system tries to reboot itself, but goes into a loop, because rebooting is impossible<br />

without init(1M).<br />

See Also<br />

For more information on booting the system, see the section on halting and booting<br />

the system in the System Administration Guide, Volume I.<br />

can’t open /dev/rdsk/<strong>string</strong>: (null):<br />

UNEXPECTED INCONSISTENCY<br />

Cause<br />

On SunOS 4.1.x, this message indicated that the device containing the /dev<br />

filesystem has become disconnected. Solaris behavior has not been confirmed.<br />

can’t synchronize with hayes<br />

Cause<br />

This message sometimes appears when using a modem that the system regards as a<br />

"Hayes" type modem, which includes most modems manufactured today. The<br />

message can be caused by incorrect switch settings, by poor cable connections, or by<br />

not turning the modem on.<br />

Action<br />

Check that the modem is on and that the cables between the modem and your<br />

system are securely connected. Check the internal and external modem switch<br />

settings. Turn the modem off and then on again, if necessary.<br />

Alphabetical Message Listing 35


cd: Too many arguments<br />

Cause<br />

The C shell’s cd(1) command takes only one argument. Either more than one<br />

directory was specified, or a directory name containing a space was specified.<br />

Directory names with spaces are easy to create with File Manager.<br />

Action<br />

Use only one directory name. To change to a directory whose name contains spaces,<br />

enclose the directory name in double (") or single (’) quotes, or use File Manager.<br />

Channel number out of range<br />

Cause<br />

The system has run out of stream devices. This error results when a stream head<br />

attempts to open a minor device that does not exist or that is currently in use.<br />

Action<br />

Check that the stream device in question exists and was created with an appropriate<br />

number of minor devices. Make sure that the hardware corresponds to this<br />

configuration. If the stream device configuration is correct, try again later when more<br />

system resources might be available.<br />

Technical Notes<br />

The symbolic name for this error is ECHRNG, errno=37.<br />

chmod: ERROR: invalid mode<br />

Cause<br />

This message from the chmod(1) command indicates a problem in the first<br />

non-option argument.<br />

36 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

If you are specifying a numeric file mode, you can provide any number of digits<br />

(although only the final one to four are considered), but all digits must be between 0<br />

and 7. If you are specifying a symbolic file mode, use the syntax provided in the<br />

chmod(1) usage message to avoid the "invalid mode" error message:<br />

Usage: chmod [ugoa][+-=][rwxlstugo] file ...<br />

Some combinations of symbolic keyletters produce no error message but fail to have<br />

any effect. The first group, [ugoa], is truly optional. The second group, [+-=], is<br />

mandatory for chmod(1) to have an effect. The third group, [rwxlstugo], is also<br />

mandatory for effect, and can be used in combination when that combination does<br />

not conflict.<br />

Command not found<br />

Cause<br />

The C shell could not find the program you gave as a command.<br />

Action<br />

Check the form and spelling of the command line. If that looks correct, echo $path<br />

to see if the user’s search path is correct. When communications are garbled, it is<br />

possible to unset a search path to such an extent that only built-in shell commands<br />

are available. Here is a command to reset a basic search path:<br />

% set path = (/usr/bin /usr/ccs/bin /usr/openwin/bin .)<br />

If the search path looks correct, check the directory contents along the search path to<br />

see if programs are missing or if directories are not mounted.<br />

See Also<br />

For more information about the C shell, see csh(1).<br />

Alphabetical Message Listing 37


Communication error on send<br />

Cause<br />

This error occurs when the current process is waiting for a message from a remote<br />

machine, but the link connecting the machines breaks.<br />

Technical Notes<br />

The symbolic name for this error is ECOMM, errno=70.<br />

Connection closed.<br />

Cause<br />

This message can appear when using rlogin(1) to another system if the remote<br />

host cannot create a process for this user, if the user takes too long to type the correct<br />

password, if the user interrupts the network connection, or if the remote host goes<br />

down. Data loss is possible if files were modified and not saved before the<br />

connection closed.<br />

Action<br />

Just try again. If the other system has gone down, wait for it to reboot first.<br />

Connection closed by foreign host.<br />

Cause<br />

When a user telnet(1)s to another system, this message can appear if the user<br />

takes too long to type the correct password, if the remote host cannot create a login<br />

for this user, or if the remote host goes down or terminates the connection. Data loss<br />

is possible if files were modified and not saved before the connection closed.<br />

Action<br />

Just try again. If the other system has gone down, wait for it to reboot first.<br />

38 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


[Connection closed. Exiting]<br />

Cause<br />

After using the talk(1) command to communicate with another user, the other<br />

person enters an interrupt (usually Control-c), and this message appears on your<br />

screen.<br />

Action<br />

Sending an interrupt like this is the usual way of exiting the talk program. The<br />

talk(1) session is over and you can return to your work.<br />

Connection refused<br />

Cause<br />

No connection could be made because the target machine actively refused it. This<br />

happens either when trying to connect to an inactive service or when a service<br />

process is not present at the requested address.<br />

Action<br />

Activate the service on the target machine, or start it up again if it has disappeared.<br />

If for security reasons you do not intend to provide this service, inform the user<br />

community, possibly suggesting an alternative.<br />

Technical Notes<br />

The symbolic name for this error is ECONNREFUSED, errno=146.<br />

Connection reset by peer<br />

Cause<br />

A connection was forcibly closed by a peer. This normally results from a loss of the<br />

connection on the remote host due to a timeout or a reboot.<br />

Alphabetical Message Listing 39


Technical Notes<br />

The symbolic name for this error is ECONNRESET, errno=131.<br />

Connection timed out<br />

Cause<br />

This occurs either when the destination host is down or when problems in the<br />

network cause lost transmission.<br />

Action<br />

First check the operation of the host system, for example by using ping(1M) and<br />

ftp(1), then repair or reboot as necessary. If that doesn’t solve the problem, check<br />

the network cabling and connections.<br />

Technical Notes<br />

No connection was established in a specified time. A connect or send request failed<br />

because the destination host did not properly respond after a reasonable interval.<br />

(The timeout period is dependent on the communication protocol.)<br />

The symbolic name for this error is ETIMEDOUT, errno=145.<br />

console login: ^J^M^Q^K^K^P<br />

Cause<br />

This usually occurs because OpenWindows exited abnormally, leaving the system’s<br />

keyboard in the wrong mode. The characters that appear when someone attempts to<br />

login are garbage transliterations of what someone types.<br />

Action<br />

On a SPARC system: find another machine and remote login to this system, then run<br />

this command:<br />

$ /usr/openwin/bin/kbd_mode -a<br />

40 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


This puts the console back into ASCII mode. Note that kbd_mode is not a windows<br />

program, it just fixes the console mode.<br />

On an x86 system: log in remotely and start, then kill the X server, or reboot the<br />

system.<br />

Technical Notes<br />

The usual reason for this problem occurring is an automated script run from<br />

cron(1M) that clears out the /tmp directory every so often. Ensure that any such<br />

scripts do not remove the /tmp/.X11-pipe or /tmp/.X11-unix directories, or<br />

any files in them.<br />

core dumped<br />

Cause<br />

A core(4) file contains an image of memory at the time of software failure, and is<br />

used by programmers to find the reason for the failure.<br />

Action<br />

To see which program produced a core(4) file, run either the file(1) command or<br />

the adb(1)(1) command. The following examples show the output of the file(1)<br />

and adb(1) commands on a core file from the dtmail program.<br />

$ file core<br />

core: ELF 32-bit MSB core file SPARC Version 1, from ‘dtmail’<br />

$ adb core<br />

core file = core -- program ‘dtmail’<br />

SIGSEGV 11: segmentation violation<br />

^D<br />

(use Control-d to quit the program)<br />

Ask the vendor or author of this program for a debugged version.<br />

Alphabetical Message Listing 41


Technical Notes<br />

Some signals, such as SIGQUIT, SIGBUS, and SIGSEGV, produce a core dump. See<br />

the signal(5) man page for a complete list.<br />

If you have the source code for the program, you can try compiling it with cc -g,<br />

and debugging it yourself using dbx or a similar debugger. The where directive of<br />

dbx provides a stack trace.<br />

On mixed networks, it can be difficult to discern which machine architecture<br />

produced a particular core dump, since adb(1) on one type of system generally<br />

cannot read a core(4) file from another type of system, and will produce an<br />

"unrecognized file" message. Run adb(1) on various machine architectures until you<br />

find the right one.<br />

The term "core" is archaic– ferrite core memory was supplanted by silicon RAM in<br />

the 1970s, although spaceships still employ core memory for its imperviousness to<br />

radiation.<br />

See Also<br />

For information on saving and viewing crash information see the System<br />

Administration Guide, Volume II. If you are using the AnswerBook, "system crash" is a<br />

good search <strong>string</strong>.<br />

corrupt label - wrong magic number or<br />

corrupt label - label checksum failed<br />

Cause<br />

After a power cycle, the machine comes up with error messages saying:<br />

corrupt label - label checksum failed or<br />

corrupt label - wrong magic number . format(1M) showed:<br />

0 unassigned wm 0 0 (0/0/0) 0<br />

1 unassigned wm 0 0 (0/0/0) 0<br />

2 backup wm 0 - 5460 4.2G (5460/0/0) 4154160<br />

3 unassigned wm 0 0 (0/0/0) 0<br />

4 unassigned wm 0 0 (0/0/0) 0<br />

5 unassigned wm 0 0 (0/0/0) 0<br />

6 unassigned wm 0 - 2730 2.1G (0/0/0) 0<br />

7 unassigned wm 2730-5460 2.1G (0/0/0) 0<br />

The disks were using raw partitions beginning at block 0 (cylinder 0).<br />

42 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


The disk label (VTOC) is kept on the block 0 of cylinder 0. The label will eventually<br />

get overwritten by database programs using raw partitions if the raw partition<br />

begins at cylinder 0. (Unix filesystems avoid this area of the partition.)<br />

Action<br />

The workaround is to go into format and get the backup label using the backup<br />

command. Relabel the disk using this backup label. You should then be able to<br />

access the disk.<br />

Backup the data on this disk.<br />

Go back to the disk and relabel it, starting the raw partition at cylinder 1 (This looses<br />

one cylinder, but prevents corrupting the VTOC).<br />

Label again.<br />

Restore the data from your backup.<br />

could not grant slave pty<br />

Cause<br />

User gets the error message: could not grant slave pty when attempting a<br />

telnet(1), rlogin(1), or rsh(1) session (anything that requires a shell) or when<br />

trying to bring up an x-term.<br />

Action<br />

The user’s file permissions were set wrong on /usr/lib/pt_chmod. The user had:<br />

# ls -la /usr/lib/pt_chmod<br />

---s--x--x 1 bin bin 3120 May 3 1996<br />

The permissions should be:<br />

# ls -la /usr/lib/pt_chmod<br />

---s--x--x 1 root bin 3120 May 3 1996<br />

Note that the owner should be root, user had bin as the owner. Also note that the<br />

setuid bit must be set. Once the user did a chown root pt_chmod, everything<br />

worked again.<br />

Alphabetical Message Listing 43


Could not initialize tooltalk (tt_open):<br />

TT_ERR_NOMP<br />

Cause<br />

Various desktop tools display or print this message when the ttsession(1) process is<br />

not available. The TookTalk service generally tries to restart ttsession(1) if it is not<br />

running. So this error indicates that the ToolTalk service is either not installed or is<br />

not installed correctly.<br />

Action<br />

Verify that the ttsession(1) command exists in /usr/openwin/bin or /usr/dt/bin.<br />

If this command is not present, ToolTalk is not installed correctly. The packages<br />

constituting ToolTalk are the runtime SUNWtltk, developer support SUNWtltkd,<br />

and the manual pages SUNWtltkm. CDE ToolTalk packages have the same names<br />

with ".2" appended.<br />

Technical Notes<br />

The full TT_ERR_NOMP message <strong>string</strong> reads as follows: "No ttsession(1) is running,<br />

probably because tt_open(3) has not been called yet. If this is returned from tt_open(3)<br />

it means ttsession(1) could not be started, which generally means ToolTalk is not<br />

installed on the system."<br />

Could not open ToolTalk Channel<br />

Cause<br />

Attempting to run workshop remotely, the error message is displayed.<br />

The fix is the following: 1. Make sure workshop is no longer running; 2. In the<br />

telnet/rlogin session window, type /bin/ps -ef | grep ttsession. If<br />

there is one running that belongs to the user who has telneted into the system,<br />

type kill pid_of_ttsession; 3. In the telnet rlogin session, type<br />

/usr/dt/bin/ttsession -s -d :0.0; 4. Start<br />

workshop.<br />

44 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Could not start new viewer<br />

Cause<br />

This message appears in the AnswerBook navigator window, along with an XView<br />

error message on the console.<br />

Action<br />

See the message "answerbook: XView error: NULL pointer passed to xv_set" for<br />

details.<br />

cpio: Bad magic number/header.<br />

Cause<br />

A cpio(1) archive has either become corrupted or was written out with an<br />

incompatible version of cpio(1).<br />

Action<br />

Use the -k option to cpio(1) to skip I/O errors and corrupted file headers. This<br />

might permit you to extract other files from the cpio(1) archive. To extract files with<br />

corrupted headers, try editing the archive with a binary editor such as emacs(1). Each<br />

cpio(1) file header contains a filename as a <strong>string</strong>.<br />

See Also<br />

For more information on magic numbers, see magic(4).<br />

Alphabetical Message Listing 45


cpio : can’t read input : end of file<br />

encountered prior to expected end of<br />

archive.<br />

Cause<br />

When we try to read a multivolume floppy in bar format using the following<br />

command:<br />

# cpio -id -H bar -I /dev/diskette0<br />

It fails with the message.<br />

Action<br />

Kill /usr/sbin/vold by running /etc/init.d/volmgt stop and use the device<br />

name /dev/rfd0<br />

Cross-device link<br />

Cause<br />

An attempt was made to make a hard link to a file on another device, such as on<br />

another filesystem.<br />

Action<br />

Establish a symbolic link using ln -s instead. Symbolic links are permitted across<br />

filesystem boundaries.<br />

Technical Notes<br />

The symbolic name for this error is EXDEV, errno=18.<br />

46 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


"D"<br />

data access exception<br />

Cause<br />

This message can result from running an old version of the operating system that<br />

does not support new hardware, or by running an operating system that is not<br />

configured for new hardware. It can also result from incorrectly installed DSIMMs or<br />

from a disk problem.<br />

Action<br />

Upgrade your operating system to a version that supports the new hardware or<br />

machine architecture. For example, upgrading a SPARCstation 2 (with sun4c kernel<br />

architecture) to a SPARCstation 20 (with sun4m kernel architecture) requires an<br />

operating system upgrade or reconfiguration.<br />

See Also<br />

For more information on upgrades, see the section describing system and device<br />

configuration in the Solaris 1.x to Solaris 2.x Transition Guide.<br />

Data fault<br />

Cause<br />

This is a kind of bad trap that usually causes a system panic. When this message<br />

appears after a bad trap message, a system text or data access fault probably<br />

occurred.¤ In the absence of a bad trap message, this message might indicate a user<br />

text or data access fault. Data loss is possible if the problem occurs other than at boot<br />

time.<br />

Action<br />

Make sure the machine can reboot, then check the log file /var/adm/messages for<br />

hints about what went wrong.<br />

Alphabetical Message Listing 47


¤ See the message "BAD TRAP" for more information.<br />

Deadlock situation detected/avoided<br />

Cause<br />

A programming deadlock situation was detected and avoided.<br />

Action<br />

If the system had not detected and avoided a deadlock, a piece of software would<br />

have hung. Run the program again. The deadlock might not reoccur.<br />

Technical Notes<br />

This error usually relates to file and record locking, but can also apply to mutexes,<br />

semaphores, condition variables, and read/write locks.<br />

The symbolic name for this error is EDEADLK, errno=45.<br />

See Also<br />

See the section on deadlock handling in the System Interface Guide. See the section on<br />

avoiding deadlock in the Multithreaded Programming Guide.<br />

Destination address required<br />

Cause<br />

A required address was omitted from an operation on a transport endpoint.<br />

Destination address required.<br />

Technical Notes<br />

The symbolic name for this error is EDESTADDRREQ, errno=96.<br />

48 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


dev/fd/int: /dev/fd/int: cannot open<br />

Cause<br />

setuid and setgid shell scripts refuse to run - they simply return an error message<br />

similar to /dev/fd/3: /dev/fd/3: cannot open. (The number following<br />

/dev/fd/ is not necessarily 3.) The first line of the script properly starts a shell, and<br />

the filesystem containing the script is not mounted with the nosuid option.<br />

Running truss on the shell script reveals that a call to open(2) is failing with error<br />

number 6 (ENXIO):<br />

open("/dev/fd/3", O_RDONLY)<br />

Err#6 ENXIO<br />

Action<br />

Setuid and setgid shell scripts use the file descriptors in /dev/fd. The contents of<br />

/dev/fd are a File Descriptor Filesystem (fdfs) on Solaris 2 and have no connection<br />

with floppy disks!<br />

Ensure that the fdfs is mounted as /dev/fd. The following line should appear in<br />

/etc/vfstab exactly like this (with NO initial comment symbol):<br />

fd - /dev/fd fd - no -<br />

before the machine is next rebooted.<br />

It may be possible to remount /dev/fd without rebooting by running the following<br />

as root:<br />

# mount fd /dev/fd<br />

If this fails the machine must be rebooted after editing /etc/vfstab as detailed<br />

above, before setuid/setgid shell scripts are available.<br />

Some administrators, unaware of what /dev/fd is for, comment out the entry in<br />

/etc/vfstab that mounts the fdfs (File Descriptor filesystem). This may go<br />

unnoticed until an attempt is made to run a setuid or setgid shell script.<br />

Alphabetical Message Listing 49


dev/rdsk/c0t6d0s2: No such file or<br />

directory<br />

Cause<br />

When attempting to eject a CD-ROM on a Ultra 450 system, the eject cdrom<br />

command fails, displaying the error message.<br />

This is because the CD-ROM is on controller 1 not 0. For the eject(1) command,<br />

the cdrom "nickname" equates to /dev/rdsk/c0t6d0s2. On an Ultra 450, the<br />

CD-ROM is /dev/rdsk/c1t6d0s2. So, using cdrom does not work.<br />

Action<br />

Use the following command instead:<br />

# eject cdrom0<br />

If volume manager (/usr/sbin/vold) is not running, you can use:<br />

# eject /dev/rdsk/c1t6d0s2<br />

Note: Make sure that the front panel of the system is unobstructed so that the<br />

CD-ROM tray is not blocked. Otherwise, the eject(1) command appears to hang<br />

since the tray is trying to open but is physically blocked.<br />

Device busy<br />

Cause<br />

An attempt was made to mount a device that was already mounted or to unmount a<br />

device containing an active file (such as an open file, a current directory, a mount<br />

point, or a running program). This message also occurs when trying to enable<br />

accounting that is already enabled.<br />

Action<br />

To unmount a device containing active processes, close all the files under that mount<br />

point, quit any programs started from there, and change directories out of that<br />

hierarchy. Then try to unmount again.<br />

50 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Technical Notes<br />

Mutexes, semaphores, condition variables, and read/write locks set this error<br />

condition to indicate that a lock is held.<br />

The symbolic name for this error is EBUSY, errno=16.<br />

device busy<br />

Cause<br />

You eject cdrom and you receive the message device busy. This could be the<br />

result of a number of problems. Below is a list of things that you can check and do to<br />

allow ejection of the cd from the device.<br />

Action<br />

Ensure that the current directory is not somewhere in the CD:<br />

% cd<br />

%eject cdrom<br />

Step B: As root<br />

# cd /etc/init.d<br />

# ./volmgt stop<br />

# eject cdrom<br />

If this works<br />

# ./volmgt start<br />

If this does not work go to step B.<br />

Step C: As root<br />

# fuser /cdrom<br />

Kill any processes that you feel that you have already terminated. A note of caution:<br />

If this is a nfs-mounted cdrom and there are other users who access this drive, make<br />

sure you know what process you are killing and why.<br />

Alphabetical Message Listing 51


# ./volmgt stop<br />

# ps -ef | grep vold<br />

If vold still exists kill the process.<br />

# eject cdrom<br />

If this does not work:<br />

# cd /vol<br />

Make sure that dev, dsk, rdsk, rmt are in the directory. If not, most possibly your<br />

/vol directory is corrupt and a reboot may be needed for proper rebuild.<br />

Step D: The last three options are: 1) Reboot. 2) If the cd drive is external to the<br />

system, try power cycling the drive and pressing the eject button. 3) If all else fails<br />

and the cdrom is external, on the right hand side of the eject button is a small hole<br />

into which you can insert a small straight device which forces manual ejection of the<br />

caddy.<br />

/dev/rdsk/<strong>string</strong>: CAN’T CHECK FILE<br />

SYSTEM.<br />

Cause<br />

The system cannot automatically clean (preen) this filesystem because it appears to<br />

be set up incorrectly or is having hard disk problems. This message asks that you<br />

run fsck(1M) manually, since data corruption might already have occurred.<br />

Action<br />

Run fsck to clean the filesystem in question. See the message "/dev/rdsk/int:<br />

UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY" for proper procedures.<br />

52 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


dev/rdsk/<strong>string</strong>: UNEXPECTED<br />

INCONSISTENCY; RUN fsck MANUALLY.<br />

Cause<br />

At boot time the /etc/rcS script runs the fsck(1M) command to check the<br />

integrity of filesystems marked "fsck" in /etc/vfstab. If fsck(1M) cannot repair a<br />

filesystem automatically, it interrupts the boot procedure and produces this message.<br />

When fsck(1M) gets into this state, it cannot repair a filesystem without losing one<br />

or more files, so it wants to defer this responsibility to you, the administrator. Data<br />

corruption has probably already occurred.<br />

Action<br />

First run fsck -n on the filesystem, to see how many and what type of problems<br />

exist. Then run fsck(1M) again to repair the filesystem. If you have a recent backup<br />

of the filesystem, you can generally answer "y" to all the fsck(1M) questions. It’s a<br />

good idea to keep a record of all problematic files and inode numbers for later<br />

reference. To run fsck(1M) yourself, specify options as recommended by the boot<br />

script. For example:<br />

# fsck /dev/rdsk/c0t4d0s0<br />

Usually the files lost during fsck(1M) repair are these that were created just before<br />

a crash or power outage, and they cannot be recovered. If you lose important files,<br />

you can recover them from backup tapes.<br />

If you don’t have a backup, ask an expert to run fsck(1M) for you.<br />

See Also<br />

For more information on file checking, see the section on checking filesystem<br />

integrity in the System Administration Guide, Volume I.<br />

Directory not empty<br />

Cause<br />

The directory operation that was attempted, such as directory removal with<br />

rmdir(1), can be performed only on an empty directory.<br />

Alphabetical Message Listing 53


Action<br />

To remove the directory, first remove all the files that it contains. A quick way to<br />

remove a non-empty directory hierarchy is with the rm -r command.<br />

Technical Notes<br />

The symbolic name for this error is ENOTEMPTY, errno=93.<br />

diskN not unique<br />

Cause<br />

During boot, system displays disk0 not unique. The error happens before the<br />

kernel loads.<br />

Action<br />

There are more than 1 devalias entries for disk0. Use devalias at the OK<br />

prompt to see the entries.<br />

To remove the duplicate, run the following command at the OK prompt:<br />

nvunalias disk0<br />

and reset the system.<br />

Disc quota exceeded<br />

Cause<br />

The user’s disk limit has been exceeded on a user filesystem, usually because a file<br />

was just created or enlarged beyond the limit. This almost always refers to a<br />

magnetic disk, and not to an optical disc. Any data created after this condition<br />

occurs will be lost.<br />

Action<br />

The user can delete files to bring disk usage under the limit, or the server<br />

administrator can use the edquota(1M) command to increase the user’s disk limit.<br />

54 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Technical Notes<br />

The symbolic name for this error is EDQUOT, errno=49.<br />

driver is already installed<br />

Cause<br />

Added the Sunpc 4.1 package and then necessary patches (102924). When trying to<br />

run sunpc_install, got the error message. prtconf(1M) shows the driver is not<br />

attached, and modinfo(1M) displays 4 modules.<br />

Tried to remove the package and back out the patch. Then reinstalled, but still<br />

received the same error.<br />

Action<br />

Sunpc had previously been installed on the system. When removing the package<br />

with the pkgrm(1M) command, not all components were removed, because<br />

pkgrm(1M) is not aware of changes made by the sunpc_install script.<br />

To resolve this problem it is necessary to remove sections pertaining to Sunpc in the<br />

files: /etc/devlink.tab, /etc/driver_aliases, and<br />

/etc/rc2.d/S10storekernname, and then reinstall the package.<br />

DUMP: Cannot open dump device<br />

‘/dev/rdsk/c2t0d0s1’: Permission denied<br />

Cause<br />

When using ufsdump(1M) as user sys (UID 3) on a disk drive in a SSA, the<br />

ufsdump(1M) command fails with the message.<br />

Action<br />

The permissions on the ssd ’instance path’ for disk in an SSA are created with 600<br />

permissions. They should be 0640 for a non-root user to be able to read them. For<br />

example,<br />

Alphabetical Message Listing 55


# ls -lL /dev/rdsk/c2t0d0s1<br />

crw------- 1 root sys 192,241 Jul 10 1996 /dev/rdsk/c2t0d0s1<br />

Change it so it reads:<br />

crw-r----- 1 root sys 192,241 Jul 10 1996 /dev/rdsk/c2t0d0s1<br />

You might also want to add the following line<br />

ssd:* 0640 root sys<br />

to the /etc/minor_perm file, so subsequently added arrays do not have the<br />

same problem.<br />

dumptm: Cannot open ‘/dev/rmt/<strong>string</strong>’:<br />

Device busy<br />

Cause<br />

During filesystem backup, the dump program cannot open the tape drive because<br />

some other process is holding it open.<br />

Action<br />

Find the process that has the tape drive open, and either kill(1) the process or wait<br />

for it to finish.<br />

# ps -ef | grep /dev/rmt<br />

# kill -9 processID<br />

56 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


DUP/BAD I=i OWNER=o MODE=m SIZE=s<br />

MTIME=t FILE=f REMOVE?<br />

Cause<br />

During phase 1, fsck(1M) found duplicate blocks or bad blocks associated with the<br />

file or directory specified after FILE=whose inode number appears after I= (with<br />

other information).<br />

Action<br />

To remove this file or directory, answer yes. If you end up removing more than a few<br />

files in this manner, data loss will result, so it might be preferable to restore the<br />

filesystem from backup tapes.<br />

See Also<br />

For more information on checking filesystems, see the section on checking filesystem<br />

integrity in the System Administration Guide, Volume I.<br />

int DUP I=int<br />

Cause<br />

Upon detecting a block that is already claimed by another inode, fsck(1M) prints<br />

the duplicate block number and its containing inode (after I=).<br />

Action<br />

In fsck(1M) phases 2 and 4, you will decide whether or not to clear these bad<br />

blocks. Before committing to repair with fsck(1M), you could determine which file<br />

contains this inode by passing the inode number to the ncheck(1M) command:<br />

# ncheck -iinum filesystem<br />

Alphabetical Message Listing 57


See Also<br />

For more information, see the chapter on checking filesystem integrity in the System<br />

Administration Guide, Volume I.<br />

"E"<br />

ENOMEM The available data space is not<br />

large enough to accommodate the shared<br />

memory segment<br />

Cause<br />

ENOMEM errors occur after 80 segments have been allocated by Lotus Notes.<br />

Action<br />

The design and implementation of Solaris ISM (Intimate Shared Memory) is what<br />

caused the ENOMEM failures, from the Lotus Notes application, besause of the limit<br />

reached on the number of shared memory segments that can be attached to a<br />

particular process.<br />

The limit occurs because all shared memory segments are attached in the Intimate<br />

Shared Memory (ISM) mode courtesy of a system variable they have set in the<br />

system file called shmsys:share_page_table.<br />

When a shared memory segment is attached in ISM mode, the OS locks that segment<br />

into physical memory and arranges the virtual/physical address mappings such that<br />

only one copy of the mapping information is shared amongst all attaching processes.<br />

To accomplish this, the OS requires the virtual starting address of the segment be<br />

aligned on a 16 Meg (hex 0x1000000) = 16777216-bytes address boundary.<br />

The NULL address lets the system decide what virtual address the segment should<br />

be attached at. The system also assigns addresses 0x3000000 apart unless forced to<br />

attach addresses at 0x1000000 apart.<br />

Doing a few calculations, a sun4d could create and attach up to 220 1-Meg ISM<br />

segments and a sun4m could create and attach up to 235 1-Meg ISM segments,<br />

providing the segments were 0x1000000 apart.<br />

Given that ISM is what causes the limit, what can we do about it?<br />

58 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


First, the limit only gives Lotus Notes the ability to attach a total of 80 Meg of shared<br />

memory total. By increasing the segment size to 10 Meg, as Lotus has already<br />

recommended, 8 ISM segments will take care of the load previously needing 80<br />

1-Meg segments. The load could conceivably grow to 800-meg now without running<br />

into the ISM addressing limit.<br />

Secondly, we could have turned off the share_page_table (ISM) flag. This would<br />

give a sun4m the ability to create in excess of 3000 1-Meg segments. The problem<br />

here is that ISM does improve the performance of shared memory accesses and if<br />

your customer intends to move up to 2.5.x, ISM is required to get around another set<br />

of problems that were discovered with shared memory loads of this kind.<br />

Thirdly, Lotus could change the Notes server so that it kept track of the attach<br />

addresses and always attached at 0x1000000 boundary addresses instead of letting<br />

the system default to the 0x3000000 address boundary. As I’ve already shown, this<br />

would allow a Notes server to grow to 235 segments on a sun4m.<br />

error 13<br />

Cause<br />

Diskless client boot gave NFS mount error 13 Diskless client boot failed with error:<br />

nfs mount failed: permission denied error 13<br />

Cause<br />

Diskless client fails to boot, NFS error 13 Diskless client fails to boot giving the error<br />

message: nfs error 13<br />

error 15 initializing<br />

Cause<br />

Bad /boot or 4.1 on ss2 - level 15 interrupt<br />

Alphabetical Message Listing 59


Error 76<br />

Cause<br />

This error is RFS specific. This is a way for the server to tell the client that a process<br />

has transferred back from mount point.<br />

Technical Notes<br />

The symbolic name for this error is EDOTDOT, errno=76.<br />

Error 88<br />

Cause<br />

Illegal byte sequence. Handle multiple characters as a single character.<br />

Technical Notes<br />

The symbolic name for this error is EILSEQ, errno=88.<br />

error: DPS has not initialized or<br />

server connection failed<br />

Cause<br />

This message appears when trying to run AnswerBook with a generic X11 window<br />

server or on a generic X terminal.<br />

Action<br />

Running AnswerBook requires Display PostScript (DPS), or a NeWS server, or the<br />

Adobe DPS NS remote display software. In addition, a complete LaserWriterII Type-1<br />

font set (including Palatino) should be installed on the X server. To find out if your X<br />

server has DPS, run xdpyinfo(1) to verify the presence of an "Adobe-DPS-Extension"<br />

line. X servers without this line don’t know about DPS.<br />

60 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Error: Error adding OS service<br />

Solaris 2.6 sparc sun4u:<br />

Cause<br />

While trying to add OS services to a newly installed Solaris 2.6 system using Solstice<br />

Adminsuite 2.3, the process fails with the following error message:<br />

Error: Error adding OS service Solaris 2.6 sparc sun4u:<br />

inconsistent revision, installed package SUNWpppk revision 3.0.1<br />

does not match revision 11.6.0,REV=1997.07.15.21.46 for sparc<br />

architecture.<br />

This is caused by the optional Solstice PPP 3.0.1 packages from the "Solaris Server<br />

Intranet Extension" CD-ROM installed on the system.<br />

Action<br />

The workaround is to remove the PPP 3.0.1 packages and replace them with the PPP<br />

packages off of the Solaris 2.6 release CD-ROM. For example:<br />

# pkgrm SUNWlicsw SUNWlit SUNWpppk SUNWpppm SUNWpppr SUNWppps SUNWpppu<br />

:<br />

: {package remove info}<br />

:<br />

# cd /cdrom/cdrom0/s0/Solaris_2.6/Product<br />

# pkgadd -d . SUNWapppr SUNWapppu SUNWpppk<br />

:<br />

: {package add info}<br />

:<br />

then use adminsuite to add the OS services, which should then work without error.<br />

NOTE: If the Solstice PPP 3.0.1 package is configured and currently in use on the<br />

system, the user should save any of the previously entered PPP configuration info so<br />

that it may be restored after the OS services have been installed. ( pkgrm(1M)the 3<br />

ppp packages installed from the 2.6 release CD, and pkgadd(1M) all of the ppp<br />

packages from the Intranet Extension cdrom again, then redo config.). If the Solstice<br />

PPP 3.0.1 package was not used on the system, there is no reason to reinstall it. Use<br />

/usr/bin/pkginfo to check the installed packages.<br />

This is documented in Chapter 9 of the Solaris Server Intranet Extension Installation<br />

and Release Notes Solaris 2.6 manual.<br />

Alphabetical Message Listing 61


Error Host Unknown:<br />

Cause<br />

On Windows95, running PC-NFSpro2.0, use ping(1M) to reach another computer<br />

on the network. ping(1M) returns Host Unknown.<br />

Cause: Name services are not set up correctly.<br />

Action<br />

1: Click the Windows 95 Start button, point to Programs, point to PC-NFSpro, then<br />

click Configuration.<br />

2: Click TCP/IP and make sure all settings are entered correctly.<br />

3: If NIS is enabled click Configure NIS and make sure the NIS domain and server<br />

names are correct.<br />

4: If DNS is enabled, click Configure DNS and make sure the DNS domain and<br />

server names are correct.<br />

5: Click on edit hosts and add the name and IP address of the machine you are<br />

trying to ping(1M), along with that of your authentication server.<br />

If you make any changes, click OK, then click Save and Exit on the Configuration<br />

dialog box. Shut down and restart Windows95.<br />

ERROR: missing file arg (cm3)<br />

Cause<br />

An attempt was made to run some sccs(1) operation that requires a filename, such<br />

as create, edit, delget, orprt.<br />

Action<br />

Supply the appropriate filename after the SCCS operation.<br />

62 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


ERROR [SCCS/s.<strong>string</strong>]: ‘SCCS/p.<strong>string</strong>’<br />

nonexistent (ut4)<br />

Cause<br />

An attempt was made to sccs(1) edit or sccs get a file that is not yet under<br />

SCCS control.<br />

Action<br />

Run sccs(1) create on that file to place it under SCCS control.<br />

ERROR [SCCS/s.<strong>string</strong>]: writable ‘<strong>string</strong>’<br />

exists (ge4)<br />

Cause<br />

An attempt was made to sccs(1) edit a file that is writable, probably because it is<br />

already checked out.<br />

Action<br />

Run sccs(1) info to see who has the file checked out. If it is you, go ahead and<br />

edit it. If it is somebody else, ask that person to check in the file.<br />

Error: you don’t have a license to<br />

run this program<br />

Cause<br />

Customer tries to mount the /export file system with Volume Manager 2.1.1,<br />

getting the message.<br />

Alphabetical Message Listing 63


Action<br />

Run vxserial -p to print the available volume manager licenses in the system.<br />

Also, check the /etc/vfstab file to make sure that the filesystem is not a vxfs<br />

filesystem.<br />

esp0: data transfer overrun<br />

Cause<br />

When a user tries to mount a CD-ROM on a third-party CD drive, mount(1M) fails<br />

with the above error, followed by the sr0: SCSI transport failed message.<br />

The CD drive probably comes from a vendor unknown to the system.<br />

Action<br />

Third-party CD drives generally have an 8192 block size, as opposed to the 512 block<br />

size on supported Sun drives. Check with the vendor to see if any special<br />

configuration is possible to allow the drive to operate on a Sun workstation.<br />

Event not found<br />

Cause<br />

This C shell message indicates that a user tried to repeat a command from the<br />

history list, but that command or number does not exist in the list.<br />

Action<br />

Run the C shell history(1) command to display recent events in the history list. If<br />

a user often tries to run commands that have disappeared from the history list, make<br />

the list longer by setting history(1) to a higher value.<br />

See Also<br />

For more information about the C shell, see csh(1).<br />

64 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


EXCESSIVE BAD BLKS I=int CONTINUE?<br />

Cause<br />

During phase 1, fsck(1M) found more than 10 bad (out-of-range) blocks associated<br />

with the specified inode number.<br />

Action<br />

With this many bad blocks, it might be preferable to restore the filesystem from<br />

backup tapes.<br />

See Also<br />

For more information on bad blocks, see the section on checking filesystem integrity<br />

in the System Administration Guide, Volume I. If you are using the AnswerBook, "bad<br />

blocks" is a good search <strong>string</strong>.<br />

EXCESSIVE DUP BLKS I=int CONTINUE?<br />

Cause<br />

During phase 1, fsck(1M) found more than 10 duplicate (previously claimed)<br />

blocks associated with the specified inode number.<br />

Action<br />

With this many duplicate blocks, it might be preferable to restore the filesystem from<br />

backup tapes.<br />

See Also<br />

For more information on blocks, see the section on checking filesystem integrity in<br />

the System Administration Guide, Volume I. If you are using the AnswerBook, "bad<br />

blocks" is a good search <strong>string</strong>.<br />

Alphabetical Message Listing 65


Exec format error<br />

Cause<br />

This often happens when trying to run software compiled for different systems or<br />

architectures, such as when executing Solaris 2 programs on a SunOS 4.1 system, or<br />

when trying to execute SPARC-specific programs on an x86 machine. On a Solaris 2<br />

system, it can also occur if the Binary Compatibility Package was not installed.<br />

Action<br />

Make sure that the software matches the architecture and system you’re using. The<br />

file(1) command can help you determine the target architecture. If you’re using<br />

SunOS 4.1 software on a Solaris 2 system, make sure that the Binary Compatibility<br />

Package is installed. You can check for it using this command:<br />

$ pkginfo | grep SUNWbcp<br />

Technical Notes<br />

A request was made to execute a file that, although it has the appropriate<br />

permissions, does not start with a valid format.<br />

The symbolic name for this error is ENOEXEC, errno=8.<br />

See Also<br />

See the a.out(4) man page for a description of executable files.<br />

"F"<br />

failed to initialize adapter<br />

Cause<br />

Using an Adaptec AHA-154x Cx SCSI HBA during installation of Solaris 2.x x86, you<br />

may see the message during the mdb device probe that says<br />

66 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


failed to initialize adapter after the probe has correctly identified the<br />

card. There are a variety of reasons for this error, but in all cases the error is because<br />

of misconfiguring the card.<br />

Action<br />

To correct the problem, press Ctrl-a during boot of the computer to enter the 154x<br />

BIOS configuration utility. Choose the<br />

Configure/View Host Adapter Settings option, then press the F6 key to<br />

return the adapter to its factory default settings.<br />

After doing this, reconfigure the adapter per the instructions contained in the x86<br />

Device Configuration Guide or Driver Update Guide if applicable. It is especially<br />

important that the adapter be configured to use DMA 6. Note that it defaults to<br />

DMA 5 and must be changed.<br />

fbconsole: ioctl SRIOCSREDIR: Device<br />

Busy.<br />

Cause<br />

When starting Openwindows from the command line, the following error message is<br />

echoed on the Solaris "Welcome" screen:<br />

fbconsole: ioctl SRIOCSREDIR: Device Busy<br />

Once inside Openwindows, The following message is displayed in the background<br />

windows and when starting cmdtool -C:<br />

SYSTEM WARNING: Object 0x340f8, Device busy, ioctl SRIOCSREDIR<br />

returned -1, attempt to make tty the console failed (Tty package)<br />

Action<br />

Openwindows was probably started in the background (using the "&"). Exit<br />

Openwindows, and run the command in foreground:<br />

/usr/openwin/bin/openwin<br />

If this doesn’t help, then perhaps some daemon or process is "holding" the console.<br />

Type the command: fuser /dev/console.<br />

A list of process id’s is returned. Examine these processes to determine if an<br />

application has hold of the console (using the ps(1) command, will help).<br />

Alphabetical Message Listing 67


fd0: unformatted diskette or no<br />

diskette in the drive<br />

Cause<br />

This message appears on the system console to indicate that the floppy driver fd(4)<br />

could not read the label on a diskette. Usually this is either because a new diskette<br />

has not yet been formatted, or a formatted diskette has become corrupted. This<br />

message often appears along with "read failed" and "bad format" messages after<br />

volcheck(1) is run.<br />

Action<br />

If you are certain that the diskette contains no data, run fdformat -d to format the<br />

diskette in DOS format. (You can also format a diskette in UFS format if you like,<br />

although then it cannot be transported to most other systems.) When the diskette is<br />

formatted, you can write on it, if it was not corrupted beyond repair.<br />

File descriptor in bad state<br />

Cause<br />

Either a file descriptor refers to no open file or a read request was made to a file that<br />

is open only for writing.<br />

Technical Notes<br />

The symbolic name for this error is EBADFD, errno=81.<br />

File exists<br />

Cause<br />

The name of an existing file was mentioned in an inappropriate context. For<br />

example, establishing a link to an existing file, or overwriting an existing file are not<br />

allowed when the csh(1) noclobber option is set.<br />

68 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

Look at the names of files in the directory, then try again with a different name or<br />

after renaming or removing the existing file.<br />

Technical Notes<br />

The symbolic name for this error is EEXIST, errno=17.<br />

File locking deadlock<br />

Cause<br />

This is a programming problem, in some cases unavoidable.<br />

Action<br />

All a user can do is restart the program and hope deadlock does not reoccur.<br />

Technical Notes<br />

In the file locking subsystem, two processes tried to modify some lock at the same<br />

time. In the multithreading subsystem, two threads became deadlocked and could<br />

not continue. When a program using the threads library encounters this error, it<br />

should restart the deadlocked threads.<br />

The symbolic name for this error is EDEADLOCK, errno=56.<br />

filemgr: mknod: Permission denied<br />

Cause<br />

File Manager issues this message and fails to come up whenever the<br />

/tmp/.removable directory is owned by another user and is not 1777 mode. This<br />

can happen, for example, when multiple users share a workstation.<br />

Alphabetical Message Listing 69


Action<br />

Have the original owner change the mode (( chmod(1)) of this file back to 1777, its<br />

default creation mode. Rebooting the workstation also resolves this problem.<br />

Technical Notes<br />

This is a known problem that was fixed in Solaris 2.4.<br />

File name too long<br />

Cause<br />

The specified file name has too many characters.<br />

Action<br />

If a file name or path name component is too long, devise a shorter name. If the total<br />

path name is longer than PATH_MAX characters, first change to an intermediate<br />

directory, then specify a shorter path name. Newly-created data will be lost unless<br />

written to another file with a shorter name.<br />

Technical Notes<br />

In a UFS or NFS-mounted UFS filesystem, the length of a path name component<br />

exceeds MAXNAMLEN (255) characters, or the total length of the path name exceeds<br />

PATH_MAX (1024) characters. In a System V filesystem, the length of a path name<br />

component exceeds NAME_MAX (14) characters while no-truncation mode is in effect.<br />

These values are defined in the /usr/include/limits.h file.<br />

The symbolic name for this error is ENAMETOOLONG, errno=78.<br />

file system full<br />

Cause<br />

During login file system full errors are seen and login fails with the message<br />

No utmpx entry.<br />

70 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


See Also<br />

See No utmpx entry, below.<br />

FILE SYSTEM STATE IN SUPERBLOCK IS<br />

WRONG; FIX?<br />

Cause<br />

The fsck(1M) command has just checked a filesystem, and has determined that the<br />

filesystem is clean. The filesystem’s superblock, however, still thinks the filesystem is<br />

"dirty" in some way.<br />

Action<br />

If you believe that the filesystem is adequately repaired, answer yes to mark the<br />

filesystem as clean.<br />

Technical Notes<br />

Different "dirty" filesystem types are listed in /usr/include/sys/fs/ufs_fs.h,<br />

and include FSACTIVE, FSBAD, FSFIX, FSLOG, and FSSUSPEND.<br />

See Also<br />

For more information on superblocks, see the section on checking filesystem integrity<br />

in the System Administration Guide, Volume I. If you are using the AnswerBook, "bad<br />

superblock" is a good search <strong>string</strong>.<br />

File table overflow<br />

Cause<br />

The kernel file table is full because too many files are open on the system.<br />

Temporarily, no more files can be opened. New data created under this condition<br />

will probably be lost.<br />

Alphabetical Message Listing 71


Action<br />

Simply waiting often gives the system time to close files. However, if this message<br />

occurs often, reconfigure the kernel to allow more open files. To increase the size of<br />

the file table in Solaris 2, increase the value of maxusers in the /etc/system file.<br />

The default maxusers value is the amount of main memory in MB, minus 2.<br />

Technical Notes<br />

The symbolic name for this error is ENFILE, errno=23.<br />

File too large<br />

Cause<br />

The file size exceeded the limit specified by ulimit(1), or the file size exceeds the<br />

maximum supported by the file system. New data created under this condition will<br />

probably be lost.<br />

Action<br />

In the C shell, use the limit(1) command to see or set the default file size. In the<br />

Bourne or Korn shells, use the ulimit -a command. Even when the shells claim<br />

that the file size is unlimited, in fact the system limit is FCHR_MAX (usually 1<br />

gigabyte).<br />

Technical Notes<br />

The symbolic name for this error is EFBIG, errno=27.<br />

FREE BLK COUNT(S) WRONG IN SUPERBLK<br />

SALVAGE?<br />

Cause<br />

During phase 5, fsck(1M) detected that the actual number of free blocks in the<br />

filesystem did not match the superblock’s free block count. The df(1M) command<br />

accesses this free block count when measuring filesystem capacity.<br />

72 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

Generally you can answer yes to this question without harming the filesystem.<br />

See Also<br />

For more information on superblocks, see the section on checking filesystem integrity<br />

in the System Administration Guide, Volume I. If you are using the AnswerBook, "bad<br />

superblock" is a good search <strong>string</strong>.<br />

fsck: Can’t open /dev/dsk/<strong>string</strong><br />

Cause<br />

The fsck(1M) command cannot open the disk device, because although a similar<br />

filesystem exists, the partition specified does not.<br />

Action<br />

Run the mount(1M) or the format(1M) command to see what filesystems are<br />

configured on the machine. Then run fsck(1M) again on an existing partition.<br />

fsck: Can’t stat /dev/dsk/<strong>string</strong><br />

Cause<br />

The fsck(1M) command cannot open the disk device, because the specified<br />

filesystem does not exist.<br />

Action<br />

Run the mount(1M) or the format(1M) command to see what filesystems are<br />

configured on the machine. Then run fsck(1M) again on an existing filesystem.<br />

Alphabetical Message Listing 73


ftp: ftp/tcp: unknown service<br />

Cause<br />

The user was getting the error. User was using no naming service and the services<br />

file looked fine. User could FTP as root but not as a normal user.<br />

Action<br />

Permissions on the /etc/services file were wrong. User changed them to read<br />

access for everyone (644) to correct the problem.<br />

fw_ipinput: q fc5fddc0:illegal<br />

interface<br />

Cause<br />

The FW-1 kernel module displays the error message when a new network interface<br />

has been added to the FW-1 system while fwd is running.<br />

Action<br />

To resolve this problem, run the following to reinstall the FW kernel and reinstall the<br />

security policy:<br />

# fw ctl uninstall<br />

# fw ctl install<br />

# fw fetch localhost<br />

74 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


fwm: no license<br />

Cause<br />

Firewall-1, version 2.1 and produces the message when the fwstart command is<br />

issued or fwm is started from the command line.<br />

There are two possible reasons for this.<br />

The first is when a firewall module is installed without a control station on the same<br />

machine, the messages are displayed on the console (under Unix) or in the event log<br />

(under WinNT)<br />

The second is that the messages may be legitimate. You may find that fwm has not<br />

started and you cannot do some crucial tasks. One problem may be that the license is<br />

issued for the wrong hostid.<br />

Action<br />

Check that the license daemon is running on the server. Then...<br />

Case one: Workaround: Ignore the messages. Solution: Upgrade to 2.1c or above.<br />

Case two: To check for a misassigned license, run the command hostid(1). Your<br />

hostid is displayed.<br />

Now run the command fw printlic. You will see output like this...<br />

This is FireWall-1 Version 2.1<br />

Type<br />

Expiration Features<br />

id-649f152b never stdlight<br />

The first field should list the correct hostid. Also check the expiration date and the<br />

features. A list of what is included with the features is provided in INFODOC 13215.<br />

If you find any inconsistencies, call the Sun License and Password Center and get a<br />

license reissued . Have you hostid and serial number ready.<br />

Alphabetical Message Listing 75


"G"<br />

giving up<br />

Cause<br />

This message appears in the SCSI log to indicate that a read or write operation has<br />

been retried until it timed out. With SCSI disk the timeout period is usually 30<br />

seconds; with tape the period is usually 20 attempts. Timeout periods are generally<br />

coded into the drivers.<br />

Action<br />

Check that all SCSI devices are connected and powered on. Make sure that SCSI<br />

target numbers are correct and not in conflict. Verify that all cables are no longer<br />

than six meters, total, and that all SCSI connections are properly terminated.<br />

Technical Notes<br />

The scsi_log(9F) routine usually displays messages on the system console and in<br />

the /var/adm/messages file. Run the dmesg(1M) command to see the most recent<br />

message buffer.<br />

Graphics Adapter device /dev/fb is of<br />

unknown type<br />

Cause<br />

The /dev/fb driver is either missing or corrupted.<br />

Action<br />

See "InitOutput: Error loading module for /dev/fb" for details.<br />

76 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


group.org_dir: NIS+ servers<br />

unreachable<br />

Cause<br />

This is the second of three messages that an NIS+ client prints when it cannot locate<br />

an NIS+ server on the network.<br />

Action<br />

See the message "hosts.org_dir: NIS+ servers unreachable" for details.<br />

"H"<br />

hang console<br />

Cause<br />

Console hangs - but all other operations are working, including rlogin(1) and<br />

telnet(1). Rebooting the system (via a remote shell) clears the problem.<br />

This problem occurs if another window is opened with the -C option, causing the<br />

console to hang. The other window could be another cmdtool window, shelltool<br />

window, or even an xterm window. Only one console window can be active at a<br />

time.<br />

Action<br />

The window/process that is causing the problem can be located by using the ps(1)<br />

command (auxw options may be necessary). The process can then be killed. Eliminate<br />

the console window running with -C, and control returns to the ’real’ console.<br />

Machine hung in reboot process: when the user is booting the machine, it hangs at<br />

checking filesystems.<br />

A possible workaround is: 1. Boot miniroot from tape or CD-ROM, 2. mkdir mnt, 3.<br />

Mount the root partition to some mountpoint, i.e. /mnt, 4. Change directory to<br />

/mnt/dev, 5. Check to see if the console is located in the, 6. mnt/dev directory, 7. If<br />

not, make the device std.: MAKEDEV std, 8. Halt the system and reboot.<br />

Alphabetical Message Listing 77


home/<strong>string</strong>: No such file or directory<br />

Cause<br />

An attempt was made to change to a user’s home directory, but either that user does<br />

not exist or the user’s fileserver has not shared (exported) that filesystem.<br />

Action<br />

To check on the existence of a particular user, run the ypmatch(1) or nismatch(1)<br />

command, specifying the user name and then the passwd(4) map.<br />

To export filesystems from the remote fileserver, become superuser on that system<br />

and run the share(1M) command with the appropriate options. If that system is<br />

sharing (exporting) filesystems for the first time, also invoke<br />

/etc/init.d/nfs.server start to begin NFS service.<br />

See Also<br />

For more information on sharing filesystems, see the share_nfs(1M) man page.<br />

Host is down<br />

Cause<br />

A transport connection failed because the destination host was down. For example,<br />

mail delivery was attempted over several days, but the destination machine was not<br />

available during any of these attempts.<br />

Action<br />

Report this error to the system administrator for the host. If you are the person<br />

responsible for this system, check to see if the machine needs repair or rebooting.<br />

Technical Notes<br />

This error results from status information delivered by the underlying<br />

communication interface. If there is no known connection to the host, a different<br />

message usually results. See "No route to host" for details.<br />

78 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


The symbolic name for this error is EHOSTDOWN, errno=147.<br />

host name configuration error<br />

Cause<br />

This is an old sendmail(1M) message, which replaced "I refuse to talk to myself"<br />

and is now replaced by the "Local configuration error" message.<br />

Action<br />

See the message "554 <strong>string</strong>... Local configuration error" for details.<br />

hosts.org_dir: NIS+ servers unreachable<br />

Cause<br />

This is the third of three messages that an NIS+ client prints when it cannot locate an<br />

NIS+ server on the network.<br />

Action<br />

If other NIS+ clients are behaving normally, check the Ethernet cabling on the<br />

workstation showing this message. On SPARC machines, disconnected network<br />

cabling also produces a series of "no carrier" messages. On x86 machines, the NIS+<br />

messages might be your only indication that network cabling is disconnected.<br />

If many NIS+ clients on the network are giving this message, go to the NIS+ server<br />

in question and reboot or repair it, as necessary. When the server machine is back in<br />

operation, NIS+ clients will give an "NIS server for domain OK" message.<br />

Alphabetical Message Listing 79


"I"<br />

I can’t read your attachments. What<br />

mailer are you using?<br />

Cause<br />

The SunView mailtool(1) and pre-3.3 OpenWindows mailtool(1) produce this message<br />

when they cannot cope with an attachment. The attachment is probably in MIME<br />

(Multipurpose Internet Mail Extensions) format, using base64 encoding.<br />

Action<br />

To read a mail message containing MIME attachments, use mailtool(1) from Solaris 2.3<br />

or later. If you are running an earlier version of Solaris, rlogin(1) to a later version<br />

of Solaris, set the DISPLAY environment variable back to the first system, and run<br />

mailtool remotely. If those options prove impossible, ask the originator to send the<br />

message again using mailtool(1), or using the CDE dtmail compose<br />

File->SendAs->SunMailTool option.<br />

Technical Notes<br />

Standard MIME attachments with base64 encoding, for example, produce this<br />

message and fail to display in older mailtool(1)s.<br />

See Also<br />

Look into using metamail, available on the Internet, which allows you to send and<br />

receive MIME attachments.<br />

Identifier removed<br />

Cause<br />

This message indicates an error in a System V IPC facility. Most likely a file<br />

associated with messaging, semaphores, or shared memory was deleted from the<br />

filesystem where it had been created.<br />

80 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Technical Notes<br />

This error is returned to processes that resume execution after the removal of an<br />

identifier from the file system’s name space. See msgctl(2), semctl(2), and<br />

shmctl(2) for details.<br />

The symbolic name for this error is EIDRM, errno=36.<br />

ie0: Ethernet jammed<br />

Cause<br />

This message can appear on SPARCservers or x86 machines with an Intel 82586<br />

Ethernet chip. It indicates that 16 successive transmission attempts failed, causing the<br />

driver to give up on the current packet.<br />

Action<br />

If this error occurs sporadically or at busy times, it probably means that the network<br />

is saturated. Wait for network traffic to clear. If bottlenecks arise frequently, think<br />

about reconfiguring the network or adding subnets.<br />

Another possible cause of this message is a noise source somewhere in the network,<br />

such as a loose transceiver connection. Use snoop(1M) or a similar program to<br />

isolate the problem area, then check and tighten network connectors as necessary.<br />

ie0: no carrier<br />

Cause<br />

This message can appear on SPARCservers or x86 machines with an Intel 82586<br />

Ethernet chip. It indicates that the chip has lost input to its carrier detect pin while<br />

trying to transmit a packet, causing the packet to be dropped.<br />

Action<br />

Check that the Ethernet connector is not loose or disconnected. Other possible causes<br />

include an open circuit somewhere in the network and noise on the carrier detect<br />

line from the transceiver. Use snoop(1M) or a similar program to isolate the<br />

problem area, then check the network connectors and transceivers, as needed.<br />

Alphabetical Message Listing 81


ifconfig: bad address<br />

Cause<br />

System fails to boot with error message: ifconfig: bad address. Upon coming<br />

up to multiuser ifconfig -a indicate:<br />

le0: flags=863 mtu 1500<br />

inet 0.0.0.0 netmask 0<br />

Once up, command<br />

# ifconfig le0 inet hostname<br />

succeeds, and all is well.<br />

Action<br />

Check /etc/hostname.* for a possible bad entry.<br />

/etc/hosts was linked to /var/named/hosts and /var was a separate file<br />

system. Until system comes up in multiuser to mount /var, hostname could not be<br />

resolved to proper IP address.<br />

ifconfig bad address le0<br />

Cause<br />

Customer installed the recommended 2.5.1 patches. When he booted, rootuser.sh<br />

presented the following errors:<br />

ifconfig bad address le0<br />

le0 arp - revarp failed no rarp replies<br />

bad address hme0<br />

hme0 auto-revarp failed: no rarp replies received.<br />

Ip address of interface is set to 0.0.0.0<br />

System fails to: resolve host ip address from /etc/host and no other rarp servers<br />

responded to the systems request for it’s IP address.<br />

82 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

The workaround is to put files before dns [NOT FOUND=return] in<br />

/etc/nsswitch.conf<br />

This matches bug number 4040423. Its summary is: if dns [NOTFOUND=return]<br />

appears before files in /etc/nsswitch.conf, at boot-time ifconfig complains<br />

about bad address. In some cases this can cause the boot to fail.<br />

If pipe/FIFO, don’t sleep in stream<br />

head<br />

Cause<br />

Streams pipe error (not externally visible).<br />

Technical Notes<br />

The symbolic name for this error is ESTRPIPE, errno=92.<br />

Illegal Instruction<br />

Cause<br />

A process has received a signal indicating that it attempted to execute an instruction<br />

that is not allowed by the kernel. This usually results from running programs<br />

compiled for a slightly different machine architecture. This message is usually<br />

accompanied by a core dump, except on read-only filesystems.<br />

Action<br />

If you are booting from CD-ROM or from the net, check README files to make sure<br />

you are using an image appropriate for your machine architecture. Run df(1M) to<br />

make sure there is enough swap space on the system; too little swap space can cause<br />

this error. If you recently upgraded your CPU to a new architecture, replace your<br />

operating system with one that supports the new architecture (an operating system<br />

upgrade might be required).<br />

Alphabetical Message Listing 83


Technical Notes<br />

Sometimes this condition results from a programming error, such as when a program<br />

attempts to execute data as instructions. This condition can also indicate device file<br />

corruption on your system.<br />

Illegal instruction "0xhex" was<br />

encountered at PC 0xhex<br />

Cause<br />

The machine is trying to boot from a non-boot device, or from a boot device for a<br />

different hardware architecture.<br />

Action<br />

If you are booting from the net, check README files to make sure you are using a<br />

boot image for that architecture. If you are booting from disk, make sure the system<br />

is looking at the right disk, which is usually SCSI target 3. Failing these solutions,<br />

connect a CD drive to the system and boot from CD-ROM.<br />

Illegal seek<br />

Cause<br />

Using a pipe ("|") on the command line doesn’t work here.<br />

Action<br />

Rather than using a pipe on the command line, redirect the output of the first<br />

program into a file and run the second program on that file.<br />

Technical Notes<br />

A call to lseek(2) was issued to a pipe. This error condition can also be fixed by<br />

altering the program to avoid using lseek(2).<br />

The symbolic name for this error is ESPIPE, errno=29.<br />

84 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Image Tool: Unable to open XIL Library.<br />

Cause<br />

This message follows multiple multi-line "XilDefaultErrorFunc" errors, indicating that<br />

ImageTool could not locate the X Imaging Library. Many OpenWindows and CDE<br />

deskset programs require XIL.<br />

Action<br />

Run pkginfo(1) to determine what packages are installed on the system. If the<br />

following packages are not present, install them from CD-ROM or over the net:<br />

SUNWxildg, SUNWxiler, SUNWxilow, and SUNWxilrt.<br />

Inappropriate ioctl for device<br />

Cause<br />

This is a programming error.<br />

Action<br />

Ask the program’s author to fix this condition. The program needs to be changed so<br />

it employs a device driver that can accept special character device controls.<br />

Technical Notes<br />

The ioctl(2) system call was given as an argument for a file that is not a special<br />

character device. This message replaces the traditional but puzzling "Not a<br />

typewriter" message.<br />

The symbolic name for this error is ENOTTY, errno=25.<br />

Alphabetical Message Listing 85


INCORRECT BLOCK COUNT I=int (should be<br />

int) CORRECT?<br />

Cause<br />

During phase 1, fsck(1M) determined that the specified inode pointed to a number<br />

of bad or duplicate blocks, so the block count should be corrected to the actual<br />

number shown.<br />

Action<br />

Generally you can answer yes to this question without harming the filesystem.<br />

See Also<br />

For more information on bad blocks, see the section on checking filesystem integrity<br />

in the System Administration Guide, Volume I.<br />

index failed:full:index preceded by<br />

saveset name<br />

Cause<br />

This is a server which has several clients. It seems when the backup kicks off many<br />

the savesets are failing with the message listed below.<br />

godzilla<br />

* godzilla:index 2 retries attempted<br />

* godzilla:index sh: save: not found<br />

index failed:full:index<br />

Action<br />

Edit the /etc/init.d/networker file and change the nsrexecd startup line to<br />

include a -p option to specify the command search path :<br />

(/usr/sbin/nsr/nsrexecd -s masters -p /usr/sbin/nsr )<br />

> /dev/console<br />

86 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


inetd[int]: execv /usr/sbin/in.uucpd:<br />

No such file or directory<br />

Cause<br />

This message indicates that the Internet services daemon inetd(1M) tried to start<br />

up the UUCP service without the UUCP daemon existing on the system.<br />

Action<br />

The SUNWbnuu package must be installed before the machine can run UUCP. Run<br />

pkgadd(1M) to install this package from the distribution CD-ROM or over the<br />

network.<br />

inetd[int]: <strong>string</strong>/tcp: unknown service<br />

Cause<br />

This message indicates that the Internet services daemon inetd(1M) could not<br />

locate the TCP service specified after the first colon.<br />

Action<br />

Check the current machine’s /etc/services file, and the NIS services map, to<br />

see if the service is described. To start this service, add an appropriate entry into the<br />

/etc/services file and possibly the services map as well. Note that NIS+ does<br />

not consult the local /etc/services file unless you put "files" right after "nisplus"<br />

on the services line of the system’s /etc/nsswitch.conf file.<br />

If you do not want to start this service, edit the system’s /etc/inetd.conf file and<br />

delete the entry that tries to start it up.<br />

See Also<br />

For more information about NIS+, see the NIS+ and FNS Administration Guide.<br />

Alphabetical Message Listing 87


inetd[int]: <strong>string</strong>/udp: unknown service<br />

Cause<br />

This message indicates that the Internet services daemon inetd(1M) could not<br />

locate the UDP service specified after the first colon.<br />

Action<br />

See the message "inetd[int]: <strong>string</strong>/tcp: unknown service" for a solution.<br />

inetd: Too many open files<br />

Cause<br />

This message can appear when someone runs a command from the shell or uses a<br />

third-party application. The sar(1) command does not indicate that the system-wide<br />

open file limit has been exceeded.<br />

Action<br />

The probable cause of this message is that the shell limit has been exceeded. The<br />

default open file limit is 64, but can be raised to 256.<br />

See the message "Too many open files" for a solution.<br />

INIT: Cannot create /var/adm/utmp or<br />

/var/adm/utmpx<br />

Cause<br />

This console message indicates that init(1M) cannot write in the /var directory,<br />

which is usually part of the / (root) filesystem. Some other messages follow, and the<br />

system usually comes up single-user. The problem is often that / or /var is<br />

mounted read-only. Sometimes a brief power outage leaves the system believing that<br />

many filesystems are still mounted.<br />

88 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

If /var is a separate filesystem on the machine, and is not yet mounted, mount it<br />

now. If the filesystem containing /var is mounted read-only, remount it read-write<br />

with a command similar to this:<br />

# mount -o rw,remount /<br />

Then type Control-d and try to bring up the system multi-user. If that fails, the root<br />

filesystem is probably corrupted. Run fsck(1M) on the root filesystem, halt the<br />

machine, power cycle the CPU, and wait for the system to reboot. Should this<br />

problem still occur, restore the root filesystem from backup tapes, or re-install the<br />

system from net or CD-ROM to replace the root filesystem.<br />

InitOutput: Error loading module for<br />

/dev/fb<br />

Cause<br />

This fatal X server error message indicates that /dev/fb, the "dumb frame buffer," is<br />

either missing or corrupted. It is usually followed by a "giving up" message and a<br />

few xinit(1) errors.<br />

Action<br />

If other devices on the system are working correctly, the most likely reason for this<br />

error is that the SUNWdfb package was removed or never installed. Insert the<br />

installation CD-ROM, change to the Solaris_2.x directory, and run the following<br />

command to install the packages SUNWdfbh and SUNWdfb (for your machine<br />

architecture):<br />

pkgadd -d .<br />

If other devices on the system are not working correctly, the system might have a<br />

corrupt /devices directory. Halt the system and boot using the -r (reconfigure)<br />

option. The system will run fsck(1M) if the /devices filesystem is corrupted,<br />

most likely fixing the problem.<br />

Alphabetical Message Listing 89


Interrupted system call<br />

Cause<br />

The user issued an interrupt signal (usually Control-c) while the system was in the<br />

middle of executing a system call. When network service is slow, interrupting cd(1)<br />

to a remote-mounted directory can produce this message.<br />

Action<br />

Proceed with your work, this message is purely informational.<br />

Technical Notes<br />

An asynchronous signal (such as interrupt or quit), which a program was set up to<br />

catch, occurred during an internal system call. If execution is resumed after<br />

processing the signal, it will appear as if the interrupted programming function<br />

returned this error condition, so the program might exit with an incorrect error<br />

message.<br />

The symbolic name for this error is EINTR, errno=4.<br />

Invalid argument<br />

Cause<br />

An invalid parameter was specified that the system cannot interpret. For example,<br />

trying to mount an uncreated filesystem, printing without sufficient system support,<br />

or providing an undefined signal to a signal(3C) library function, can all produce<br />

this message.<br />

Action<br />

If you see this message when you are trying to mount a filesystem, make sure that<br />

you have run newfs(1M) to create the filesystem. If you see this message when you<br />

are trying to read a diskette, make sure that the diskette was properly formatted with<br />

fdformat(1), either in DOS format ( pcfs(7FS)) or as a UFS filesystem. If you see<br />

this message while you are trying to print, make sure that the print service is<br />

configured correctly.<br />

90 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Technical Notes<br />

The symbolic name for this error is EINVAL, errno=22.<br />

Invalid null command<br />

Cause<br />

This C shell message results from a command line with two pipes (|) in a row or<br />

from a pipe without a command afterwards.<br />

Action<br />

Change the command line so that each pipe is followed by a command.<br />

Invalid_SS_JWS_HOME:no<br />

C:\\lib\basicframe.properties<br />

Cause<br />

Customer is running WinNT 4.0 and goes to lauch Java Workshop - then gets the<br />

error message.<br />

Action<br />

Removed software loaded on her system from marimba company. The product is<br />

castanet. Removed the preduct from thesystem and JWS works fine. Apparently,<br />

product Tuner comes loaded with JDK, and this conflicts with JWS.<br />

Check out www.marimba.com for more details on marimba products.<br />

Another possible solution:<br />

Double-click jws.exe within the: C:\Java-WorkShop\jws\intel-win32\bin\<br />

folder.<br />

Alphabetical Message Listing 91


I/O error<br />

Cause<br />

Some physical Input/Output error has occurred. If the process was writing a file at<br />

the time, data corruption is possible.<br />

Action<br />

First find out which device is experiencing the I/O error. If the device is a tape drive,<br />

make sure a tape is inserted into the drive. When this error occurs with a tape in the<br />

drive, it is likely that the tape contains an unrecoverable bad spot.<br />

If the device is a floppy drive, an unformatted or defective diskette could be at fault.<br />

Format the diskette, or obtain a replacement.<br />

If the device is a hard disk drive, you might need to run fsck(1M) and possibly<br />

even reformat the disk.<br />

Technical Notes<br />

In some cases this error might occur on a call following the one to which it actually<br />

applies.<br />

The symbolic name for this error is EIO, errno=5.<br />

Is a directory<br />

Cause<br />

An attempt was made to read or write a directory as if it were a file.<br />

Action<br />

Look at a listing of all the files in the current directory and try again, specifying a file<br />

instead of a directory.<br />

Technical Notes<br />

The symbolic name for this error is EISDIR, errno=21.<br />

92 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


"J"<br />

java.lang.UnsatisfiedLinkError:<br />

Cause<br />

When trying to start Java Workshop 2.0 (or some other Java applications) on Solaris<br />

2, the following error is displayed:<br />

java.lang.UnsatisfiedLinkError: setCursor<br />

at sun.awt.motif.MComponentPeer.initialize(Compiled Code)<br />

at sun.awt.motif.MTextAreaPeer.initialize(Compiled Code)<br />

at sun.awt.motif.MComponentPeer.(Compiled Code)<br />

at sun.awt.motif.MTextAreaPeer.(Compiled Code)<br />

at sun.awt.motif.MToolkit.createTextArea(Compiled Code)<br />

Action<br />

The LD_LIBRARY_PATH is probably set up to include a Java lib directory that<br />

doesn’t quite match the java bin command used. For example, on Solaris 2.6:<br />

LD_LIBRARY_PATH = /usr/openwin/lib results in Java Workshop runing<br />

properly. But setting: LD_LIBRARY_PATH = /usr/java/lib:/usr/openwin/lib<br />

results in the error being displayed, since Java Workshop uses its own version of JDK<br />

and the startup process picks up a mixture of versions.<br />

The answer is to not include /usr/java/lib in your LD_LIBRARY_PATH since it is<br />

needed only in rare circumstances, such as if you are using the Java Invocation API.<br />

Alphabetical Message Listing 93


"K"<br />

kernel read error<br />

Cause<br />

This message appears when savecore(1M), if activated, tries to copy a debugging<br />

image of kernel memory to disk but cannot read various kernel data structures<br />

correctly. Generally this occurs after a system panic has corrupted main memory.<br />

Data corruption on the system is possible.<br />

Action<br />

Look at the kernel error messages that preceded this one to try to determine the<br />

cause of the problem. Error messages such as BAD TRAP usually indicate faulty<br />

hardware. Until the problem that caused the kernel panic is resolved, a kernel core<br />

image cannot be saved for debugging.<br />

Killed<br />

Cause<br />

This message is purely informational. If the killed process was writing a file, some<br />

data might be lost.<br />

Action<br />

Continue with your work.<br />

Technical Notes<br />

This message from the signal handler or various shells indicates that a process has<br />

been terminated with a SIGKILL. However, if you don’t see this message and cannot<br />

terminate a process with a SIGKILL, you might have to reboot the machine to get<br />

rid of that process.<br />

94 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


kmem_free block already free<br />

Cause<br />

This is a programming error, probably from a device driver.<br />

Action<br />

Determine which driver is giving this message and contact the vendor for a software<br />

update, as this message indicates a bug in the driver.<br />

Technical Notes<br />

This message is from the DDI programming function kmem_free(9F), which<br />

releases a block of memory at address addr of size siz that was previously<br />

allocated by the DDI function kmem_alloc(9F). Both addr and siz must<br />

correspond to the original allocation. If you have source code for the driver, follow<br />

kmem_alloc(9F) and kmem_free(p]9F) in the code to make sure they allocate and free<br />

the same chunk of memory.<br />

"L"<br />

last message repeated int times<br />

Cause<br />

This message comes from syslogd(1M), the facility that prints messages on the<br />

console and records them in /var/adm/messages. To reduce the log size and<br />

minimize buffer usage, syslog collapses any identical messages it sees during a 20<br />

second period, then prints this message with the number of repetitions.<br />

Action<br />

Look above this message to see which message was repeated so often. Then consider<br />

the repeated message and take action accordingly. If repeated log entries such as<br />

su ... failed appear, consider the possibility of a security breach.<br />

Alphabetical Message Listing 95


ld.so.1 fatal: can’t set protection<br />

on segment<br />

Cause<br />

Applications have recently begun to fail with this error:<br />

ld.so.1 fatal: can’t set protection on segment. The failures are<br />

random.<br />

Action<br />

This was happening because of the recent introduction of a rogue application that<br />

consumed most of the swap space on the system. The other applications, which failed<br />

randomly, were doing so because of insufficient swap space to run. The error from<br />

ld.so.1 happened because there was no segment on which to set the protections.<br />

ld.so.1: <strong>string</strong>: fatal: <strong>string</strong>: can’t open<br />

file: errno=2<br />

Cause<br />

This message is produced in Solaris 2.5.1 and earlier releases. It is not produced in<br />

Solaris 2.6, or later releases.<br />

See the next message, which has the same cause.<br />

Action<br />

See the next message, which can be resolved by the same action.<br />

See Also<br />

For more information about the Linker, see the Linker and Libraries Guide.<br />

96 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


ld.so.1: <strong>string</strong>: fatal: <strong>string</strong>: open<br />

failed: No such file or directory<br />

Cause<br />

This message is produced in Solaris 2.6 and later releases. It is not produced in<br />

Solaris 2.5.1, or earlier releases.<br />

This message indicates that the run time linker, ld.so.1(1), while running the<br />

program specified after the first colon, could not find the shared object specified after<br />

the third colon. (A shared object is sometimes called a dynamically linked library.)<br />

Action<br />

As a workaround, set the environment variable LD_LIBRARY_PATH to include the<br />

location of the shared object in question, for example:<br />

/usr/dt/lib:/usr/openwin/lib<br />

Better yet, if you have access to source code, recompile the program using the -Rpath<br />

loader option. Using LD_LIBRARY_PATH slows down performance.<br />

See Also<br />

For more information about the Linker, see the Linker and Libraries Guide.<br />

ld.so.1: <strong>string</strong>: fatal: relocation<br />

error: symbol not found: <strong>string</strong><br />

Cause<br />

This message is produced in Solaris 2.5.1 and earlier releases. It is not produced in<br />

Solaris 2.6, or later releases.<br />

See the next message, which has the same cause.<br />

Action<br />

See the next message, which can be resolved by the same action.<br />

Alphabetical Message Listing 97


Technical Notes<br />

This error does not necessarily occur when you first bring up an application. It could<br />

take months to develop, if ordinary use of the application seldom references the<br />

undefined symbol.<br />

See Also<br />

For more information about the Linker, see the Linker and Libraries Guide.<br />

ld.so.1: <strong>string</strong>: fatal: relocation error:<br />

<strong>string</strong>: <strong>string</strong>: referenced symbol not found<br />

Cause<br />

This message is produced in Solaris 2.6 and later releases. It is not produced in<br />

Solaris 2.5.1, or earlier releases.<br />

The message from the run time linker ld.so.1(1) indicates that in trying to execute<br />

the application given after the first colon, the specified symbol could not be found<br />

for relocation. The message goes on to say in what file the symbol was referenced.<br />

Since this is a fatal error, the application terminates with this message.<br />

Action<br />

Run the ldd -d command on the application to show its shared object dependencies<br />

and symbols that aren’t found. Probably your system contains an old version of the<br />

shared object that should contain this symbol. Contact the library vendor or author<br />

for an update.<br />

Technical Notes<br />

This error does not necessarily occur when you first bring up an application. It could<br />

take months to develop, if ordinary use of the application seldom references the<br />

undefined symbol.<br />

See Also<br />

For more information about the Linker, see the Linker and Libraries Guide.<br />

98 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


le0: Memory error!<br />

Cause<br />

This message indicates that the network interface encountered an access time-out<br />

from the CPU’s main memory. There is probably nothing wrong except system<br />

overload.<br />

Action<br />

If the system is busy with other processes, this error can occur frequently. If possible,<br />

try to reduce the system load by quitting applications or killing some processes.<br />

Technical Notes<br />

The Lance Ethernet chip timed out while trying to acquire the bus for a DVMA<br />

transfer. Most network applications wait for a transfer to occur, so generally no data<br />

gets lost. However, data transfer might fail after too many time-outs.<br />

See Also<br />

For more information about the Lance Ethernet chip, see the le(7D) man page.<br />

le0: No carrier-- cable disconnected<br />

or hub link test disabled?<br />

Cause<br />

Standalone machines with no Ethernet port connection get this error when the system<br />

tries to access the network. If the Ethernet cable is disconnected, SPARC machines<br />

with the sun4m architecture usually display this message, whereas machines with<br />

the sun4c architecture usually display the "le0: No carrier– transceiver cable problem"<br />

message instead. If the Ethernet cable is connected, this message could result from a<br />

mismatch between the machine’s NVRAM settings and the Ethernet hub settings.<br />

Action<br />

If this message is continuous, try to save any work to local disk.<br />

Alphabetical Message Listing 99


When a machine is configured as a networked system, it must be plugged into the<br />

Ethernet with a twisted pair J45 connector.<br />

If the Ethernet cable is plugged in, find out whether or not the Ethernet hub does a<br />

Link Integrity Test. Then become superuser to check and possibly set the machine’s<br />

NVRAM. If the hub’s Link Integrity Test is disabled, set this variable to false.<br />

# eeprom | grep tpe<br />

tpe-link-test?=true<br />

# eeprom ’tpe-link-test?=false’<br />

The default setting is true. If for some reason tpe-link-test? was set to false,<br />

and the hub’s Link Integrity Test is enabled, reset this variable to true.<br />

le0: No carrier-- transceiver cable<br />

problem?<br />

Cause<br />

Standalone machines with no Ethernet port connection get this error when the<br />

system tries to access the network.<br />

Action<br />

If this message is continuous, try to save any work to local disk.<br />

When a machine is configured as a networked system, it must be plugged into the<br />

Ethernet with either a twisted pair J45 connector or thicknet 10Base-T connector<br />

(depending on the building’s Ethernet cable type).<br />

Technical Notes<br />

Older workstations have a thicknet connection on the back instead of a twisted pair<br />

Ethernet connection, so they require a thicknet to twisted pair transceiver to translate<br />

between cabling types.<br />

100 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


level 15 interrupt<br />

Cause<br />

on an SS20<br />

.lib section in a.out corrupted<br />

Cause<br />

Trying to exec(2) an a.out(4) that requires a static shared library (to be linked in)<br />

and there was erroneous data in the .lib section of the a.out(4). The .lib section<br />

tells exec(2) which static shared libraries are needed. The a.out(4) is probably<br />

corrupted.<br />

Technical Notes<br />

The symbolic name for this error is ELIBSCN, errno=85.<br />

LINK COUNT FILE I=i OWNER=o MODE=m<br />

SIZE=s MTIME=t COUNT... ADJUST?<br />

Cause<br />

During phase 4, fsck(1M) determined that the inode’s link count for the specified<br />

file is wrong, and asks if you want to adjust it to the value given.<br />

Action<br />

Generally you can answer yes to this question without harming the filesystem.<br />

See Also<br />

For more information on fsck(1M), see the section on checking filesystem integrity<br />

in the System Administration Guide, Volume I.<br />

Alphabetical Message Listing 101


Link has been severed<br />

Cause<br />

This error occurs when the connection to a remote machine is gone, for example after<br />

a remote procedure call is interrupted.<br />

Technical Notes<br />

The symbolic name for this error is ENOLINK, errno=67.<br />

LL105W: Protocol error detected.<br />

Cause<br />

This error message comes from Lifeline Mail, an unbundled PC compatibility<br />

application.<br />

Action<br />

The likeliest cause for this problem is that someone set up a user account without a<br />

password. Assign the user a password to solve this problem.<br />

ln: cannot create /dev/fb: Read-only<br />

file system<br />

Cause<br />

During device reconfiguration at boot time, the system cannot link to the frame<br />

buffer because /dev is on a read-only filesystem.<br />

Action<br />

Check that /dev/fb is a symbolic link to the hardware frame buffer, such as<br />

cgsix(7D) or tcx(7D). Ensure that the filesystem containing /dev is mounted<br />

read-write.<br />

102 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


lockd[int]: create_client: no name for<br />

inet address 0xhex<br />

Cause<br />

This lock daemon message usually indicates that the NIS hosts.byname and<br />

hosts.byaddr maps are not coordinated.<br />

Action<br />

Wait a short time for the maps to synchronize. If they don’t, take steps to coordinate<br />

them.<br />

See Also<br />

For information on updating NIS data, see the section on NIS maps in the NIS+ and<br />

FNS Administration Guide. If you are using the AnswerBook, "hosts.byaddr" is a<br />

good search <strong>string</strong>.<br />

Login incorrect<br />

Cause<br />

This message from the login(1) program indicates an incorrect combination of<br />

login name and password. There is no way to tell whether what’s wrong is the login<br />

name, the password, or both. Other programs such as ftp(1), rexecd(1M),<br />

sulogin(1M), and uucp(1C) also give this error under similar conditions.<br />

Action<br />

Check the /etc/passwd file and the NIS or NIS+ passwd map on the local system<br />

to see if an entry exists for this user. If a user has simply forgotten the password,<br />

su(1M) and set a new one with the passwd(1) username command. This command<br />

automatically updates the NIS+ passwd map, but with NIS you’ll need to coordinate<br />

the update with the passwd map.<br />

The "Login incorrect" problem can also occur with older versions of NIS when the<br />

user name has more than eight characters. If this is the case, edit the NIS password<br />

file, change the user name to have eight or fewer characters, and then remake the<br />

NIS passwd map.<br />

Alphabetical Message Listing 103


If you cannot log in to the system as root, despite knowing the proper password, it is<br />

possible that the /etc/passwd file is corrupted. Try to log in as a regular user and<br />

su(1M) to root.<br />

If that doesn’t work, see the message "su: No shell" and follow most of the<br />

instructions given there. Instead of changing the default shell, make the password<br />

field blank in /etc/shadow.<br />

lp hang<br />

Cause<br />

On a print server, the queue continues to grow but nothing comes out of the printer.<br />

The printer daemon is hung.<br />

Action<br />

Here is a simple procedure for flushing a hung printing queue: 1. Login or switch<br />

user to root; 2. Issue the reject(1M) printername command to make sure no one<br />

sends any job to the printer; 3. Turn off power to the printer; 4. If the active job<br />

appears to be causing the hang, remove it from the print queue with the cancel(1)<br />

jobnumber command, and ask the owner to requeue that print job; 5. Shut down the<br />

print queue with the /usr/lib/lpshut command; 6. Remove the lock file<br />

/var/spool/lp/SCHEDLOCK and the temporary files /var/spool/lp/tmp/*/*;<br />

7. Turn the printer back on; 8. Restart the print queue with the /usr/lib/lpsched<br />

command.<br />

See Also<br />

For more information on print queuing, see the System Administration Guide, Volume<br />

II. If you are using the AnswerBook, "print server" is a good search <strong>string</strong>.<br />

104 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


"M"<br />

Machine is not on the network<br />

Cause<br />

This error is Remote File Sharing (RFS) specific. It occurs when users try to advertise,<br />

unadvertise, mount, or unmount remote resources while the machine has not<br />

properly started a network connect.<br />

Technical Notes<br />

The symbolic name for this error is ENONET, errno=64.<br />

mailtool: Can’t create dead letter:<br />

Permission denied<br />

Cause<br />

An attempt was made to send a message with mailtool(1) from a directory where the<br />

user does not have write permission, and the user’s home directory is currently<br />

unavailable.<br />

Action<br />

Change to another directory and start mailtool(1) again, or use chmod(1) to change<br />

permissions for the directory (if possible).<br />

Alphabetical Message Listing 105


mailtool: Could not initialize the<br />

Classing Engine<br />

Cause<br />

When a user runs mailtool(1) on a remote machine, setting the DISPLAY environment<br />

back to the local machine, this message might appear inside a dialog box window.<br />

The dialog box goes on to say that the Classing Engine must be installed to use<br />

Attachments. This problem occurs because rlogin(1) does not propagate the user’s<br />

environment.<br />

Action<br />

Exit mailtool(1) and set your OPENWINHOME environment variable to /usr/openwin.<br />

Then run mailtool(1) again. The error message will not appear, and you will be able<br />

to use Attachments.<br />

Technical Notes<br />

Classing Engine is a new name for Tool Talk. Earlier versions of mailtool(1) said<br />

Tool Talk: TT_ERR_NOMP instead of Classing Engine.<br />

Mail Tool is confused about the state<br />

of your Mail File.<br />

Cause<br />

This message appears in a pop-up dialog box whenever you ask mailtool(1) to access<br />

messages after another mail reader has modified your inbox. A request follows:<br />

"Please Quit this Mail Tool."<br />

Action<br />

Click "Continue" to close the dialog box, then exit mailtool(1). If you continue trying<br />

to read mail, messages deleted by the other mail reader will never appear, and<br />

mailtool(1) will fail to see any new messages.<br />

106 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


mail: Your mailfile was found to be<br />

corrupted (Content-length mismatch).<br />

Cause<br />

This message comes from mail(1) or mailx(1)(1) whenever it detects messages<br />

with a different content length than advertised. The mail(1) program tells you<br />

which message might be truncated or might have another message concatenated to it.<br />

Two common causes of content length mismatches are the simultaneous use of<br />

different mail readers (such as mail(1) and mailtool(1)), or using a mail reading<br />

program (or an editor) that does not update the Content-Length field after altering a<br />

message.<br />

Action<br />

The mailx(1) program can usually recover from this error and delineate mail<br />

message boundaries correctly. Pay close attention to the message that might be<br />

truncated or combined with another message, and to all messages after that one. If a<br />

mail file becomes hopelessly corrupted, run it through a text editor to eliminate all<br />

Content-Length lines, and ensure that each message has a From (no colon) line for<br />

each message, preceded by a blank line.<br />

To avoid mailfile corruption, exit from mailtool(1) without saving changes when you<br />

are currently running mail(1) or mailx(1).<br />

Machine is not on the network<br />

Cause<br />

This error is Remote File Sharing (RFS) specific. It occurs when users try to advertise,<br />

unadvertise, mount, or unmount remote resources while the machine has not<br />

properly started a network connect.<br />

Technical Notes<br />

The symbolic name for this error is ENONET, errno=64.<br />

Alphabetical Message Listing 107


mailtool: Can’t create dead letter:<br />

Permission denied<br />

Cause<br />

An attempt was made to send a message with mailtool(1) from a directory where the<br />

user does not have write permission, and the user’s home directory is currently<br />

unavailable.<br />

Action<br />

Change to another directory and start mailtool(1) again, or use chmod(1) to change<br />

permissions for the directory (if possible).<br />

mailtool: Could not initialize the<br />

Classing Engine<br />

Cause<br />

When a user runs mailtool(1) on a remote machine, setting the DISPLAY environment<br />

back to the local machine, this message might appear inside a dialog box window.<br />

The dialog box goes on to say that the Classing Engine must be installed to use<br />

Attachments. This problem occurs because rlogin(1) does not propagate the user’s<br />

environment.<br />

Action<br />

Exit mailtool(1) and set your OPENWINHOME environment variable to /usr/openwin.<br />

Then run mailtool(1) again. The error message does not appear, and you can use<br />

Attachments.<br />

Technical Notes<br />

Classing Engine is a new name for Tool Talk. Earlier versions of mailtool(1) said<br />

Tool Talk: TT_ERR_NOMP instead of Classing Engine.<br />

108 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Mail Tool is confused about the state<br />

of your Mail File.<br />

Cause<br />

This message appears in a pop-up dialog box whenever you ask mailtool(1)(1) to<br />

access messages after another mail reader has modified your inbox. A request<br />

follows: "Please Quit this Mail Tool."<br />

Action<br />

Click "Continue" to close the dialog box, then exit mailtool(1). If you continue trying<br />

to read mail, messages deleted by the other mail reader will never appear, and<br />

mailtool(1) will fail to see any new messages.<br />

mail: Your mailfile was found to be<br />

corrupted (Content-length mismatch).<br />

Cause<br />

This message comes from mail(1) or mailx(1) whenever it detects messages with a<br />

different content length than advertised. The mail(1) program tells you which<br />

message might be truncated or might have another message concatenated to it.<br />

Two common causes of content length mismatches are the simultaneous use of<br />

different mail readers (such as mail(1) and mailtool(1)), or using a mail reading<br />

program (or an editor) that does not update the Content-Length field after altering a<br />

message.<br />

Action<br />

The mailx(1) program can usually recover from this error and delineate mail<br />

message boundaries correctly. Pay close attention to the message that might be<br />

truncated or combined with another message, and to all messages after that one. If a<br />

mail file becomes hopelessly corrupted, run it through a text editor to eliminate all<br />

Content-Length lines, and ensure that each message has a From (no colon) line for<br />

each message, preceded by a blank line.<br />

To avoid mailfile corruption, exit from mailtool(1) without saving changes when you<br />

are currently running mail(1) or mailx(1).<br />

Alphabetical Message Listing 109


file name may contain holes - can’t swap<br />

on it.<br />

Cause<br />

A swapfile was created with the command:<br />

# mkfile -nv 50m /ab/swap_50mb<br />

When the user tried to add the file:<br />

# swap -a /ab/swap_50mb<br />

It failed with the message:<br />

/ab/swap_50mb may contain holes - can’t swap on it.<br />

/ab/swap_50mb: Error 0<br />

Action<br />

The -n option was supported on SunOS 4, but on SunOS 5 (Solaris 2) -n works only<br />

when the file is to be used by NFS. Local swap files cannot be created with the -n<br />

option.<br />

mbuf map full<br />

Cause<br />

mbuf allocation<br />

Memory address alignment<br />

Cause<br />

This message can occur when printing large files on a SPARCprinter attached to a<br />

SPARCstation 2.<br />

110 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

Replace the SPARCstation 2 CPU with one that is at the most recent dash level.<br />

memory leaks<br />

Cause<br />

An application uses up more and more memory, until all swap space is exhausted.<br />

Action<br />

Many developers have found that third party software (such as Purify) can help<br />

identify memory leaks in their applications. If you suspect that you have a memory<br />

leak, you can use sar(1) to check on the Kernel Memory Allocation (KMA). Any<br />

driver or module that uses KMA resources, but does not specifically return the<br />

resources before it exits, can create a memory leak.<br />

See Also<br />

For more information on memory leaks, see the section on monitoring system<br />

activity in the System Administration Guide, Volume II. If you are using the<br />

AnswerBook, "displaying disk usage" is a good search <strong>string</strong>. Also, see the section on<br />

system resource problems in the NIS+ and FNS Administration Guide.<br />

Message too long<br />

Cause<br />

A message sent on a transport provider was larger than the internal message buffer<br />

or some other network limit.<br />

Technical Notes<br />

The symbolic name for this error is EMSGSIZE, errno=97.<br />

Alphabetical Message Listing 111


mount: /dev/dsk/<strong>string</strong> is already<br />

mounted, /<strong>string</strong> is busy, or...<br />

Cause<br />

While trying to mount a filesystem, the mount(1M) command received a "Device<br />

busy" (EBUSY) error code. There are several possible reasons: this /dev/dsk<br />

filesystem is already mounted on a different directory, the busy path name is the<br />

working directory of an active process, or the system has exceeded its maximum<br />

number of mount points (unlikely).<br />

Action<br />

Run /etc/mount to see if the filesystem is already mounted. If not, check to see if<br />

any shells are active in the busy directory (did the user cd(1) into the directory?), or<br />

if any processes in the ps(1) listing are active in that directory. If the reason for the<br />

error message isn’t obvious, try using a different directory for the mount point.<br />

mount: giving up on: /<strong>string</strong><br />

Cause<br />

An existing server did not respond to an NFS mount request, so after retrying a<br />

number of times (default 1000), the mount(1M) command has given up. Nonexistent<br />

servers or bad mount points produce different messages.<br />

Action<br />

If the RPC: Program not registered message precedes this one, the requested<br />

mount server probably did not share (export) any filesystems, so it has no NFS<br />

daemons running. Have the superuser on the mount server share(1M) the<br />

filesystem, then run /etc/init.d/nfs.server start to begin NFS service.<br />

If the requested mount server is down or slow to respond, check to see whether the<br />

machine needs repair or rebooting.<br />

112 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


mount: mount-point /<strong>string</strong> does not exist.<br />

Cause<br />

Someone tried to mount a filesystem onto the specified directory, but there is no such<br />

directory.<br />

Action<br />

If this is the directory name you want, run mkdir(1) to create this directory as a<br />

mount point.<br />

mount: the state of /dev/dsk/<strong>string</strong> is<br />

not okay<br />

Cause<br />

The system was unable to mount the filesystem that was specified because the<br />

super-block indicates that the filesystem might be corrupted. This is not an<br />

impediment for read-only mounts.<br />

Action<br />

If you don’t need to write on this filesystem, mount(1M) it using the -o ro option.<br />

Otherwise, do as one of the message continuation lines suggests and run fsck(1M)<br />

to correct the filesystem state and update the super-block.<br />

See Also<br />

For more information on using fsck(1M), see the section on checking filesystem<br />

integrity in the System Administration Guide, Volume I.<br />

Alphabetical Message Listing 113


Multihop attempted<br />

Cause<br />

This error occurs when users try to access remote resources that are not directly<br />

accessible.<br />

Technical Notes<br />

The symbolic name for this error is EMULTIHOP, errno=74.<br />

mbuf map full<br />

Cause<br />

mbuf allocation<br />

Memory address alignment<br />

Cause<br />

This message can occur when printing large files on a SPARCprinter attached to a<br />

SPARCstation 2.<br />

Action<br />

Replace the SPARCstation 2 CPU with one that is at the most recent dash level.<br />

memory leaks<br />

Cause<br />

An application uses up more and more memory, until all swap space is exhausted.<br />

114 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

Many developers have found that third party software (such as Purify) can help<br />

identify memory leaks in their applications. If you suspect that you have a memory<br />

leak, you can use sar(1) to check on the Kernel Memory Allocation (KMA). Any<br />

driver or module that uses KMA resources, but does not specifically return the<br />

resources before it exits, can create a memory leak.<br />

See Also<br />

For more information on memory leaks, see the section on monitoring system<br />

activity in the System Administration Guide, Volume II. If you are using the<br />

AnswerBook, "displaying disk usage" is a good search <strong>string</strong>. Also, see the section on<br />

system resource problems in the NIS+ and FNS Administration Guide.<br />

Message too long<br />

Cause<br />

A message sent on a transport provider was larger than the internal message buffer<br />

or some other network limit.<br />

Technical Notes<br />

The symbolic name for this error is EMSGSIZE, errno=97.<br />

mount: /dev/dsk/<strong>string</strong> is already<br />

mounted, /<strong>string</strong> is busy, or...<br />

Cause<br />

While trying to mount a filesystem, the mount(1M) command received a "Device<br />

busy" (EBUSY) error code. There are several possible reasons: this /dev/dsk<br />

filesystem is already mounted on a different directory, the busy path name is the<br />

working directory of an active process, or the system has exceeded its maximum<br />

number of mount points (unlikely).<br />

Alphabetical Message Listing 115


Action<br />

Run /etc/mount to see if the filesystem is already mounted. If not, check to see if<br />

any shells are active in the busy directory (did the user cd(1) into the directory?), or<br />

if any processes in the ps(1) listing are active in that directory. If the reason for the<br />

error message isn’t obvious, try using a different directory for the mount point.<br />

mount: giving up on: /<strong>string</strong><br />

Cause<br />

An existing server did not respond to an NFS mount request, so after retrying a<br />

number of times (default 1000), the mount(1M) command has given up. Nonexistent<br />

servers or bad mount points produce different messages.<br />

Action<br />

If the "RPC: Program not registered" message precedes this one, the requested mount<br />

server probably did not share (export) any filesystems, so it has no NFS daemons<br />

running. Have the superuser on the mount server share(1M) the filesystem, then<br />

run /etc/init.d/nfs.server start to begin NFS service.<br />

If the requested mount server is down or slow to respond, check to see whether the<br />

machine needs repair or rebooting.<br />

mount: mount-point /<strong>string</strong> does not<br />

exist.<br />

Cause<br />

Someone tried to mount a filesystem onto the specified directory, but there is no such<br />

directory.<br />

Action<br />

If this is the directory name you want, run mkdir(1) to create this directory as a<br />

mount point.<br />

116 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


mount: the state of /dev/dsk/<strong>string</strong> is<br />

not okay<br />

Cause<br />

The system was unable to mount the filesystem that was specified because the<br />

super-block indicates that the filesystem might be corrupted. This is not an<br />

impediment for read-only mounts.<br />

Action<br />

If you don’t need to write on this filesystem, mount(1M) it using the -o ro option.<br />

Otherwise, do as one of the message continuation lines suggests and run fsck(1M)<br />

to correct the filesystem state and update the super-block.<br />

See Also<br />

For more information on using fsck(1M), see the section on checking filesystem<br />

integrity in the System Administration Guide, Volume I.<br />

Multihop attempted<br />

Cause<br />

This error occurs when users try to access remote resources that are not directly<br />

accessible.<br />

Technical Notes<br />

The symbolic name for this error is EMULTIHOP, errno=74.<br />

Alphabetical Message Listing 117


"N"<br />

Name not unique on network<br />

Cause<br />

Given log name not unique.<br />

Technical Notes<br />

The symbolic name for this error is ENOTUNIQ, errno=80.<br />

named [pid]: hostname.domainname has CNAME<br />

and other data (illegal)<br />

Cause<br />

On the DNS server, the error message is displayed.<br />

Action<br />

This error indicates that an alias (CNAME) is associated with another type of DNS<br />

record.<br />

The DNS system allow you to set up an alias to a system using the CNAME record.<br />

An example of this is:<br />

alias1<br />

IN CNAME host1.domain1.<br />

The alias alias1 cannot appear in any other type of record. Only the actual name of<br />

the host may be used. So, if you wanted to use this host as a mail exhanger, the<br />

record:<br />

alias1 IN MX 10 host2.domain1.<br />

would be illegal and would produce the error.<br />

Instead, you should use<br />

118 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


host1 IN MX 10 host2.domain1.<br />

This goes for all types of records, including HINFO and A records.<br />

Also, it is possible to get this error without explicitly setting the left hand side of a<br />

record. The DNS system defaults the left hand side to the last given left hand side.<br />

So you might have the following in a named database file:<br />

host1 IN A 123.124.125.126<br />

IN HINFO Sun Solaris<br />

alias1 IN CNAME host1.domain1.<br />

IN MX 10 host2.domain1.<br />

In this fragment, there is an implied alias1 in the left hand side of the MX record.<br />

If the alias was added after the database was in use for a while, the error would<br />

suddenly start showing up. Since the MX record was legal until the CNAME was added<br />

in front of it. This example could be fixed either by reversing the order of the MX and<br />

CNAME records, or explicitly giving the host1 in the lefthand side of the MX record.<br />

/net/<strong>string</strong>: No such file or directory<br />

Cause<br />

A user tried to change directory (for example with cd(1)) to a network partition on<br />

the system specified after /net/, but this host either does not exist or has not shared<br />

(exported) any filesystem.<br />

Action<br />

To gain access to files on this system, try rlogin(1).<br />

To export filesystems from the remote system, become superuser on that system and<br />

run the share(1M) command with the appropriate options. If that system is sharing<br />

filesystems for the first time, also run /etc/init.d/nfs.server start to begin<br />

NFS service.<br />

Alphabetical Message Listing 119


Network dropped connection because of<br />

reset<br />

Cause<br />

The host you were connected to crashed and rebooted.<br />

Technical Notes<br />

The symbolic name for this error is ENETRESET, errno=129.<br />

Network is down<br />

Cause<br />

A transport connection failed because it encountered a dead network.<br />

Action<br />

Report this error to the system administrator for the network. If you are the person<br />

responsible for this network, check to see why the network is dead and what repairs<br />

are necessary.<br />

Technical Notes<br />

This error results from status information delivered by the underlying<br />

communication interface.<br />

The symbolic name for this error is ENETDOWN, errno=127.<br />

Network is unreachable<br />

Cause<br />

An operational error occurred either because there was no route to the network or<br />

because negative status information was returned by intermediate gateways or<br />

switching nodes.<br />

120 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


The returned status is not always sufficient to distinguish between a network that is<br />

down and a host that is down. See the "No route to host" message.<br />

Action<br />

Check the network routers and switches to see if they are disallowing these packet<br />

transfers. If they are allowing all packet transfers, check network cabling and<br />

connections.<br />

Technical Notes<br />

The symbolic name for this error is ENETUNREACH, errno=128.<br />

NFS getattr failed for server <strong>string</strong>:<br />

RPC: Timed out<br />

Cause<br />

This message appears on an NFS client that requested a service from an NFS server<br />

whose hardware is failing. Often the message "NFS read failed" appears along with<br />

this message. If the server were merely down or slow to respond, the "NFS server<br />

not responding" message would appear instead. Data corruption on the server<br />

system is possible.<br />

Action<br />

Because this message usually indicates server hardware failure, initiate repair<br />

procedures as soon as possible. Check the memory modules, disk controllers, and<br />

CPU board.<br />

See Also<br />

For more information on NFS tuning, see chapter on monitoring network<br />

performance in the System Administration Guide, Volume II.<br />

Alphabetical Message Listing 121


nfs mount: Couldn’t bind to reserved<br />

port<br />

Cause<br />

This message appears when a client attempts to NFS mount a filesystem from a server<br />

that has more than one Ethernet interface configured on the same physical subnet.<br />

Action<br />

Always connect multiple Ethernet interfaces on one router system to different<br />

physical subnetworks.<br />

NFS mounted callog file Unsupported.<br />

Cause<br />

After installing Solaris 2.6 on a system, when users try to bring up their calendars<br />

either with CDE’s calendar manager (/usr/dt/bin/dtcm) or OpenWindow’s<br />

calendar (/usr/openwin/bin/cm), they see the dialog box:<br />

Calendar :Informational - NFS mounted callog file Unsupported.<br />

Your default startup Calendar file appears to be NFS mounted or<br />

a symlink to the same. This is Not Supported.<br />

Continue<br />

The following error is displayed in the console window when the Continue button is<br />

clicked:<br />

date time host rpc.cmsd[pid]: rpc.cmsd :<br />

NFS mounted callog file Not Supported - user@host<br />

date time host rpc.cmsd[pid]: rpc.cmsd :<br />

NFS mounted callog file Not Supported - user@host<br />

The calendars would’ve worked under 2.5.1 or before however.<br />

122 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

It has long been known that NFS mounted calendars are not supported in Solaris. Of<br />

the calendar can be corrupted when more than one person uses the calendar at the<br />

same time. If two rpc.cmsd daemons write to the callog file at the same time, the<br />

file becomes corrupt. However, two rpc.cmsd daemons could be run simultaneously<br />

through Solaris 2.5.1 even though this isn’t a supported configuration.<br />

With Solaris 2.6, this is no longer an option. rpc.cmsd does not allow the user to<br />

bring up a calendar that is NFS mounted and produces the error message above.<br />

nfs mount: mount: <strong>string</strong>: Device busy<br />

Cause<br />

This message appears when the superuser attempts to NFS mount on top of an<br />

active directory. The busy device is actually the working directory of a process.<br />

Action<br />

Determine which shell on the workstation is currently located below the mount<br />

point, and change out of that directory. Be wary of subshells (such as su(1M) shells)<br />

that could be in different working directories while the parents remain below the<br />

mount point.<br />

NFS mount: /<strong>string</strong> mounted OK<br />

Cause<br />

While booting, the system failed to mount the directory specified after the first colon,<br />

probably because the NFS server involved was down or slow to respond. The mount<br />

ran in the background and successfully contacted the NFS server.<br />

Action<br />

This is a purely informative message to let you know that the mount process has<br />

completed.<br />

Alphabetical Message Listing 123


NFS read failed for server <strong>string</strong><br />

Cause<br />

This is generally a permissions problem. Perhaps a directory or file permission was<br />

changed while the client held the file open. Perhaps the filesystem’s share or<br />

netgroup permissions changed. If the server were down or the network saturated,<br />

the "NFS server not responding" message would appear instead.<br />

Action<br />

Log in to the NFS server and check the permissions of directories leading to the file.<br />

Make certain that the filesystem is shared with (exported to) the client experiencing<br />

an NFS read failure.<br />

See Also<br />

For more information, see the chapter on NFS troubleshooting in the NFS<br />

Administration Guide.<br />

nfs_server: bad getargs for int/int<br />

Cause<br />

This message comes from the NFS server when it recieves a request with<br />

unrecognized or incorrect arguments. Typically, it means the request could not be<br />

XDR decoded properly. This can result from corruption of the packet over the<br />

network, or from an implementation bug causing the NFS client to improperly<br />

encode its arguments.<br />

Action<br />

If this message originates from a single client, investigate that machine for NFS client<br />

software bugs. If this message appears throughout a network, especially accompanied<br />

by other networking errors, investigate the network cabling and connectors.<br />

124 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


NFS server <strong>string</strong> not responding still<br />

trying<br />

Cause<br />

In most cases this very common message indicates that the system has requested a<br />

service from an NFS server that is either down or extremely slow to respond. In<br />

some cases this message indicates that the network link to this NFS server is broken,<br />

although usually that condition generates other error messages as well. In a few<br />

cases this message indicates NFS client set-up problems.<br />

Action<br />

Check the non-responding NFS server to see whether the machine needs repair or<br />

rebooting. Encourage your user community to report such problems quickly but only<br />

once.<br />

Should this message appear when booting a diskless client, make sure that the<br />

client’s /etc/hosts file and the network naming service (NIS, NIS+, or other<br />

/etc/hosts files on the network) have been updated.<br />

See Also<br />

For more information, see the chapter on NFS troubleshooting in the NFS<br />

Administration Guide.<br />

NFS server <strong>string</strong> ok<br />

Cause<br />

This message is the follow-up to the "NFS server not responding" error. It indicates<br />

that the NFS server is back in operation.<br />

Action<br />

When an NFS server first comes up, it is busy fulfilling client requests for a while. Be<br />

patient and wait for your client system to respond. Making many extraneous<br />

requests only further slows the NFS server response time.<br />

Alphabetical Message Listing 125


NFS <strong>string</strong> failed for server <strong>string</strong>: error<br />

int (<strong>string</strong>)<br />

Cause<br />

The failed NFS operation could be any one of the following: getattr, setattr, lookup,<br />

access, readlink, read, write, create, mkdir, symlink, mknod, remove, rmdir, rename,<br />

link, readdir, readdir+, fsstat, fsinfo, pathconf, or commit.<br />

See Also<br />

For more information on NFS, see the NFS Administration Guide.<br />

nfs umount: <strong>string</strong>: is busy<br />

Cause<br />

This message appears when the superuser attempts to unmount an active NFS<br />

filesystem. The busy point is the working directory of a process.<br />

Action<br />

Determine which shell (or process) on the workstation is currently located in the<br />

remotely mounted filesystem, and change ( cd(1)) out of that directory. Be wary of<br />

subshells (such as su(1M) shells) that could be in different directories while the<br />

parent shells remain in the NFS filesystem.<br />

NFS write error on host <strong>string</strong>: No space<br />

left on device.<br />

Cause<br />

This console message indicates that an NFS-mounted partition has filled up and<br />

cannot accept writing of new data. Unfortunately, software that attempts to<br />

overwrite existing files will usually zero out all data in these files. This is particularly<br />

destructive on NFS-mounted /home partitions.<br />

126 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

Find the user or process that is filling up the filesystem, and stop the out-of-control<br />

process as soon as you can. Then delete files as necessary to create more space on the<br />

filesystem (large core(4) files are good candidates for deletion). Have users write<br />

any modified files to local disk if possible. If this error occurs often, redistribute<br />

directories to ease demand on this partition.<br />

See Also<br />

For more information on disk usage, see the System Administration Guide, Volume II.<br />

If you are using the AnswerBook, "managing disk use" is a good search <strong>string</strong>.<br />

NFS write failed for server <strong>string</strong>: RPC:<br />

Timed out<br />

Cause<br />

This error can occur when a file system is soft-mounted, and server or network<br />

response time lags. Any data written to the server during this period could be<br />

corrupted.<br />

Action<br />

If you intend to write on a filesystem, never specify the soft mount option. Use the<br />

default hard mount for all the filesystems that are mounted read-write.<br />

See Also<br />

For more information, see the chapter on NFS troubleshooting in the NFS<br />

Administration Guide.<br />

Alphabetical Message Listing 127


NIS+ authentication failure<br />

Cause<br />

This is a Federated Naming Service message. The operation could not be completed<br />

because the principal making the request could not be authenticated with the name<br />

service involved.<br />

Action<br />

Run the nisdefaults(1) command to verify that you are identified as the correct<br />

NIS+ principal. Also check that the system has specified the correct public key source.<br />

See Also<br />

For more information, see the authentication and authorization overview in the NIS+<br />

and FNS Administration Guide.<br />

nis_cachemgr: Error in reading NIS<br />

cold start file :<br />

’/var/nis/NIS_COLD_START’<br />

Cause<br />

After installing patches 104331-04 and 103612-33, nis_cachemgr(1M) failed to<br />

come up. The symptoms are as follows during the reboot:<br />

Sep 11 16:34:00 nis_cachemgr: Error in reading NIS cold start file :<br />

’/var/nis/NIS_COLD_START’<br />

and nis_cachemgr(1M) isn’t there after logging in. Trussing nis_cachemgr(1M)<br />

showed that it is reading /var/nis/NIS_COLD_START and immediately reporting<br />

an error. Neither reinitializing the client nor copying NIS_COLD_START helps.<br />

Action<br />

This is a timing problem. Put a sleep(1) before the NIS+ initialization in<br />

/etc/init.d/rpc, after rpc.bind has been started. rpc.bind is slow about<br />

128 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


initializing and needs a few extra seconds to get going before nis_cachemgr(1M)<br />

starts bugging it.<br />

No buffer space available<br />

Cause<br />

An operation on a transport endpoint or pipe was not performed because the system<br />

lacked sufficient buffer space or because a queue was full. The target system<br />

probably ran out of memory or swap space. Any data written during this condition<br />

will probably be lost.<br />

Action<br />

To add more swap area, use the swap -a command on the target system.<br />

Alternatively, reconfigure the target system to have more swap space. As a general<br />

rule, swap space should be two to three times as large as physical memory.<br />

Technical Notes<br />

The symbolic name for this error is ENOBUFS, errno=132.<br />

No child processes<br />

Cause<br />

This message can appear when an application tries to communicate with cooperating<br />

process that do not exist.<br />

Action<br />

Restart the parent process so it can create the child processes again. If that doesn’t<br />

help, this could be the result of a programming error; contact the vendor or author of<br />

the program for an update.<br />

Alphabetical Message Listing 129


Technical Notes<br />

A wait(2) system call was executed by a process that had no existing or<br />

unwaited-for child processes. The child processes could have exited prematurely, or<br />

might never have been created.<br />

The symbolic name for this error is ECHILD, errno=10.<br />

No default media available<br />

Cause<br />

The volume manager issues this message if a user makes an eject(1) request when<br />

the drives contain no diskette or CD-ROM to eject.<br />

Action<br />

Insert a diskette or CD-ROM. If the volume manager is confused and there actually is<br />

a diskette or CD-ROM in a drive, run volcheck(1) to update the volume manager.<br />

If the system remains confused, try booting with the -r option to reconfigure devices.<br />

No directory! Logging in with home=/<br />

Cause<br />

The login(1) program could not find the home directory listed in the password file<br />

or NIS passwd(4) map, so it deposited the user in the root directory.<br />

Action<br />

Check that the user’s home directory is mounted and is owned by and accessible to<br />

that user. Perhaps the automounter tried to mount the home directory, but the NFS<br />

server did not respond quickly enough. Try listing the files in /home/username. If the<br />

NFS server responds to this request, have the user log out and log in again.<br />

It is possible that the automounter daemon is not running. Run the ps(1) command<br />

to see if automountd(1M) is present. If not, run the second command; if it appears<br />

to be wedged, run both these commands:<br />

130 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


# /etc/init.d/autofs stop<br />

# /etc/init.d/autofs start<br />

When the automounter daemon is running, verify that the /etc/auto_master file<br />

has a line like this:<br />

/home<br />

auto_home<br />

Verify that the /etc/auto_home file has a line like this:<br />

+auto_home<br />

These entries depend on the NIS auto_home map.<br />

It is also possible that the NFS server has not shared (exported) this /home directory,<br />

or that the NFS daemons on the server have disappeared.<br />

See Also<br />

For more information on NFS, see the NFS Administration Guide.<br />

No message of desired type<br />

Cause<br />

An attempt was made to receive a message of a type that does not exist on the<br />

specified message queue. See the msgsnd(2) and msgrcv(2) man pages for details.<br />

Action<br />

This indicates an error in the System V IPC message facility. Generally the message<br />

queue is empty or devoid of the desired message type, while IPC_NOWAIT is set.<br />

Technical Notes<br />

The symbolic name for this error is ENOMSG, errno=35.<br />

Alphabetical Message Listing 131


No recipients specified<br />

Cause<br />

This message comes from the mailx(1) command whenever a user doesn’t provide<br />

an address in the To: field.<br />

Action<br />

See the message "Recipient names must be specified" for details.<br />

No record locks available<br />

Cause<br />

No more record locks are available. The system lock table is full.<br />

Technical Notes<br />

The symbolic name for this error is ENOLCK, errno=46.<br />

Perhaps a process called fcntl(2) with the F_SETLK or F_SETLKW option, and the<br />

system maximum was exceeded. The system contains several different locking<br />

subsystems, including fcntl(2), the NFS lock daemon, and mail locking, all of<br />

which can produce this error.<br />

Action<br />

Try again later, when more locks might be available.<br />

No route to host<br />

Cause<br />

An operational error occurred because there was no route to the destination host, or<br />

because of status information returned by intermediate gateways or switching nodes.<br />

The returned status is not always sufficient to distinguish between a host that is<br />

down and a network that is down. See the "Network is unreachable" message.<br />

132 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

Check the network routers and switches to see if they are disallowing these packet<br />

transfers. If they are allowing all packet transfers, check network cabling and<br />

connections.<br />

Technical Notes<br />

The symbolic name for this error is EHOSTUNREACH, errno=148.<br />

No shell Connection closed<br />

Cause<br />

A user has attempted to a remote login to the system, and has a valid account name<br />

and password, but the shell specified for their account is not available on that<br />

system. For example, the seventh field could request the GNU Bourne-again shell<br />

/bin/bash, which does not exist on standard Solaris distributions.<br />

Action<br />

If you have a copy of the requested shell, become superuser and install the missing<br />

shell on that system. Otherwise, change the user’s password file entry (perhaps only<br />

in the NIS+ or NIS passwd(4) map) to specify an available shell such as /bin/csh<br />

or /bin/ksh.<br />

No space left on device<br />

Cause<br />

While writing an ordinary file or creating a directory entry, there was no free space<br />

left on the device. The disk, tape, or diskette is full of data. Any data written to that<br />

device during this condition will be lost.<br />

Action<br />

Remove unneeded files from the hard disk or diskette until there is space for all the<br />

data you are writing. It might be advisable to move some directories onto another<br />

Alphabetical Message Listing 133


filesystem and create symbolic links accordingly. When a tape is full, continue on<br />

another one, use a higher density setting, or obtain a higher-capacity tape.<br />

To create multi-volume tapes or diskettes, use the pax(1) or cpio(1) command;<br />

tar(1) is still limited to a single volume.<br />

Technical Notes<br />

The symbolic name for this error is ENOSPC, errno=28.<br />

No such device<br />

Cause<br />

An attempt was made to apply an operation to an inappropriate device, such as<br />

writing to a nonexistent device.<br />

Action<br />

Look in the /devices directory to see why this device does not exist, or why the<br />

program expects it to exist. The similar "No such device or address" message tends<br />

to indicate I/O problems with an existing device, whereas this message tends to<br />

indicate a device that does not exist at all.<br />

Technical Notes<br />

The symbolic name for this error is ENODEV, errno=19.<br />

No such device or address<br />

Cause<br />

This error can occur when a tape drive is off-line or when a device has been<br />

powered off or removed from the system.<br />

134 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

For tape drives, make sure the device is connected, powered on, and toggled online<br />

(if applicable). For disk and CD-ROM drives, check that the device is connected and<br />

powered on.<br />

With all SCSI devices, ensure that the target switch or dial is set to the number<br />

where the system originally mounted it. To inform the system of a change to the<br />

target device number, reboot using the -r (reconfigure) option.<br />

Technical Notes<br />

This message results from I/O to a special file’s subdevice that either does not exist<br />

or that exists beyond the limit of the device.<br />

The symbolic name for this error is ENXIO, errno=6.<br />

No such file or directory<br />

Cause<br />

The specified file or directory does not exist. Either the file name or path name was<br />

entered incorrectly.<br />

Action<br />

Check the file name and path name for correctness and try again. If the specified file<br />

or directory is a symbolic link, it probably points to a nonexistent file or directory.<br />

Technical Notes<br />

The symbolic name for this error is ENOENT, errno=2.<br />

no such map in server’s domain<br />

Cause<br />

A user or an application tried to look up something using Network Information<br />

Services (NIS), but NIS has no corresponding database for this request.<br />

Alphabetical Message Listing 135


Action<br />

Make sure the NIS map name is spelled correctly. To see a list of nicknames for the<br />

various NIS maps, run the ypcat -x command. To see a full list of the various NIS<br />

maps (databases), run the ypwhich -m command. If the NIS service were not<br />

running on the current machine, these commands would result in a "can’t<br />

communicate with ypbind" message.<br />

No such process<br />

Cause<br />

This process cannot be found. The process could have finished execution and<br />

disappeared, or it might still be in the system under a different numeric ID.<br />

Action<br />

Use the ps(1) command to check that the process ID you’re supplying is correct.<br />

Technical Notes<br />

No process corresponds to the specified process ID (PID), light-weight process ID, or<br />

thread_t.<br />

The symbolic name for this error is ESRCH, errno=3.<br />

No such user as <strong>string</strong>-- cron entries<br />

not created<br />

Cause<br />

A file exists in /var/spool/cron/crontabs for the specified user, but this user is<br />

not in /etc/passwd or the NIS passwd(4) map. The system cannot create<br />

cron(1M) entries for nonexistent users.<br />

136 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

To eliminate this message at boot time, remove the cron file for the nonexistent user,<br />

or rename it if the user’s login name has changed. If this is a valid user, create an<br />

appropriate password entry for this name.<br />

Not a data message<br />

Cause<br />

During a read(2), getmsg(2), or ioctl(2) I_RECVFD call to a STREAMS device,<br />

something has come to the head of the queue that can’t be processed. That something<br />

depends on the call: 1. read(2): control information or passed file descriptor; 2.<br />

getmsg(2): passed file descriptor; 3. ioctl(2): control or data information.<br />

Technical Notes<br />

The symbolic name for this error is EBADMSG, errno=77.<br />

Not a directory<br />

Cause<br />

A non-directory was specified where a directory is required, such as in a path prefix<br />

or as an argument to the chdir(2) call.<br />

Action<br />

Look at a listing of all the files in the current directory and try again, specifying a<br />

directory instead of a file.<br />

Technical Notes<br />

The symbolic name for this error is ENOTDIR, errno=20.<br />

Alphabetical Message Listing 137


Not a stream device<br />

Cause<br />

A putmsg(2) or getmsg(2) system call was attempted on a file descriptor that is<br />

not a STREAMS device.<br />

Technical Notes<br />

The symbolic name for this error is ENOSTR, errno=60.<br />

Not enough space<br />

Cause<br />

This message indicates that the system is running many large applications<br />

simultaneously, and has run out of swap space (virtual memory). It could also<br />

indicate that applications failed without freeing pages from the swap area. Swap<br />

space is an area of disk set aside to store portions of applications and data not<br />

immediately required in memory. Any data written during this condition will<br />

probably be lost.<br />

Action<br />

Reinstall or reconfigure the system to have more swap space. A general rule of<br />

thumb is that swap space should be two to three times as large as physical memory.<br />

Alternatively, use mkfile(1M) and swap(1M) to add more swap area. This<br />

example shows how to add 16 MB of virtual memory in the /usr/swap file (any<br />

filesystem with enough free space would work):<br />

# mkfile 16m /usr/swap<br />

# swap -a /usr/swap<br />

To make this automatic at boot time, add the following line to the /etc/vfstab file:<br />

/usr/swap - - swap - no -<br />

138 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Technical Notes<br />

In calling the fork(2), exec(2), sbrk(2), or malloc(3C) routine, a program asked<br />

for more memory than the system could supply. This is not a temporary condition;<br />

swap space is a system parameter.<br />

The symbolic name for this error is ENOMEM, errno=12.<br />

not found<br />

Cause<br />

This message indicates that the Bourne shell could not find the program name given<br />

as a command.<br />

Action<br />

Check the form and spelling of the command line. If that looks correct, echo $PATH<br />

to see if the user’s search path is correct. When communications are garbled, it is<br />

possible to unset a search path to such an extent that only built-in shell commands<br />

are available. Here is a command to reset a basic search path:<br />

$ PATH=/usr/bin:/usr/ccs/bin:/usr/openwin/bin:.<br />

If the search path looks correct, check the directory contents along the search path to<br />

see if programs are missing or if directories are not mounted.<br />

NOTICE: vxvm: unexpected status on<br />

close<br />

Cause<br />

Everytime system boots (or is shut down), the message is displayed on the console.<br />

Sometimes, the following message is also displayed on the console and in the<br />

/var/adm/messages file:<br />

WARNING:<br />

/iommu@0,10000000/sbus@0,10001000/SUNW,soc@2,0/SUNW,pln@a0000000,74127a/ssd@4,2<br />

(continued)<br />

Alphabetical Message Listing 139


(Continuation)<br />

(ssd22):<br />

Error for Command: Error Level: Fatal<br />

Requested Block: 0 Error Block: 0<br />

Vendor: CONNER<br />

Serial Number: 93081LPT<br />

Sense Key: Aborted Command<br />

ASC: 0xb3 (), ASCQ: 0x0, FRU: 0x0<br />

WARNING:<br />

/iommu@0,10000000/sbus@0,10001000/SUNW,soc@2,0/SUNW,pln@a0000000,74127a/ssd@4,2<br />

(ssd22): ssd_synchronize_cache failed (5)<br />

Action<br />

In a High Availability system with NVRAM, this would be caused by unprocessed<br />

data in a NVRAM cache of the active logical host that has been down and come up<br />

later. Because of this, NVRAM should not be used in an HA system. The problem<br />

can be solved in this case by getting rid of the NVRAM on the HA system.<br />

In a non-HA system, this can also be caused by stale data in the NVRAM cache. (The<br />

example commands below assume the controller for the array is c1.) To fix for a<br />

non-HA system:<br />

1. Turn off all fast writes on this array and sync any remaining pending writes.<br />

# ssaadm fast_write -d c1<br />

# ssaadm sync_cache c1<br />

2. When you sync the fast writes to the array, all pending writes are physically made<br />

to the disks. Anything that is left in the cache is stale, and thus, it is safe to purge it.<br />

Run the command:<br />

# ssaadm purge c1<br />

3. Turn the fast writes for the disks back on. This command MAY be different on<br />

your system, depending on the disks on which you want fast writes enabled and the<br />

types of fast writes you want:<br />

# ssaadm fast_write -s -e c1<br />

140 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


NOTICE: /<strong>string</strong>: out of inodes<br />

Cause<br />

The filesystem specified after the first colon probably contains many small files,<br />

exceeding the per-filesystem limit for inodes (file information nodes).<br />

Action<br />

If many small files were created unintentionally, removing them resolves the problem.<br />

Otherwise, follow these steps to increase filesystem capacity for small files. Make<br />

several backup copies of the filesystem on different tapes (for safety), then bring the<br />

machine down to single-user mode. Use the newfs(1M) command with the -i<br />

option to increase inode density for this filesystem. Here is an example:<br />

# newfs -i 1024 /dev/rdsk/partition<br />

Finally, restore the filesystem from a backup tape. Note that increasing the inode<br />

density slightly reduces total filesystem capacity.<br />

Not login shell<br />

Cause<br />

This message results when a user tries to logout(1) from a shell other than the one<br />

started at login time.<br />

Action<br />

To quit a non-login shell, use the exit(1) command. Continue doing so until you<br />

have logged out.<br />

See Also<br />

For more general information on the login shell, see the section on customizing your<br />

work environment in the Solaris Advanced User’s Guide.<br />

Alphabetical Message Listing 141


Not on system console<br />

Cause<br />

A user tried to login(1) to a system as the superuser (uid=0, which is not<br />

necessarily root) from a terminal other than the console.<br />

Action<br />

Login to that system as a normal user, then run su(1M) to become superuser. To<br />

allow superuser logins from any terminal, comment out the CONSOLE line in<br />

/etc/default/login (this is not recommended for security reasons).<br />

Not owner<br />

Cause<br />

Either an ordinary user tried to do something reserved for the superuser, or the user<br />

tried to modify a file in a way restricted to the file’s owner or to the superuser.<br />

Action<br />

Switch user to root and try again.<br />

Technical Notes<br />

The symbolic name for this error is EPERM, errno=1.<br />

Not supported<br />

Cause<br />

This version of the system does not support the feature requested, although future<br />

versions of the system might provide support.<br />

142 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

This is generally not a system message from the kernel, but an error returned by an<br />

application. Contact the vendor or author of the application for an update.<br />

Technical Notes<br />

The symbolic name for this error is ENOTSUP, errno=48.<br />

No utmpx entry<br />

Cause<br />

During login file system full errors are seen and login fails with the message<br />

No utmpx entry.<br />

This is caused by a full file system so that the system has no space to write its utmpx<br />

(login info) entry.<br />

Action<br />

To get around this condition the system must be booted into single user mode. Then<br />

clear (do not delete) the files: /var/adm/utmp and /var/adm/utmpx. This can be<br />

done by typing:<br />

#cat /dev/null > /var/adm/utmp<br />

#cat /dev/null > /var/adm/utmpx<br />

These commands zero out the files but keep it with the correct permissions.<br />

In some cases after clearing these files the /var filesystem may still be full. In this<br />

case type:<br />

du -askd /var |sort -nr |more<br />

This will give you a listing of the files from largest to smallest in the /var filesystem.<br />

To create space you can zero the files: /var/cron/log, /var/spool/lp/logs, and<br />

/var/adm/messages. You can also check /.wastebasket for large files to delete.<br />

Alphabetical Message Listing 143


6/04/98 7:27:54 nsrck: SYSTEM error,<br />

more space needed to compress [client]<br />

index, 8.1 MB required<br />

Cause<br />

In networker, cannot use the Remove Oldest Cycle feature because the /nsr<br />

filesystem is too full to perform a remove. An error message appears in the console<br />

window indicating that the file system is full.<br />

Action<br />

1. Stop the networker daemons so that some of the indices can be moved. In<br />

SunOS 5, use /etc/init.d/networker stop. In SunOS 4, use<br />

ps -ef | grep nsr and kill(1) the processes.<br />

2. Find a filesystem with enough space to move one of the client’s indices. Only one<br />

of the client’s indices should be moved, not the networker server’s index. To find<br />

the size of a client’s index, go to /nsr/index/client name/db and list the contents<br />

using ls -l. The data base file can be very large (possibly over 500 MB).<br />

3. Move the contents of a client’s index to the other filesystem and check that /nsr<br />

has freed the space to use. It may be necessary to unmount and remount /nsr, or<br />

even to reboot to designate the space freed by the move as available.<br />

4. Once the space is available, restart the daemons.<br />

5. Go into nwadmin. Under Clients--Indexes, select a client and use<br />

Remove Oldest Cycle to free more space.<br />

Use Reclaim Space to reclaim the space from the removed cycles. After a few of<br />

the old cycles have been removed, there should be enough space in the filesystem to<br />

move the removed client’s index back.<br />

6. Stop the daemons, and move the client’s index back to /nsr/index/clientname.<br />

7. Restart the daemons. Remove oldest cycles for the client that was just moved.<br />

Tweaking of the browse policy and retention policy may be necessary to prevent this<br />

situation from happening in the future.<br />

Other, long term solutions are to add more hard disk and run growfs or move /nsr<br />

to a drive with more space on it.<br />

144 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


"O"<br />

Object is remote<br />

Cause<br />

This error occurs when users try to share a resource that is not on the local machine,<br />

or try to mount/unmount a device or path name that is on a remote machine.<br />

Technical Notes<br />

The symbolic name for this error is EREMOTE, errno=66.<br />

ok<br />

Cause<br />

This is the OpenBoot PROM monitor prompt. From this prompt, you can boot the<br />

system (from disk, CD-ROM, or net), or you can use the go command to continue<br />

where you left off.<br />

Action<br />

If you suddenly see this prompt, look at the messages above it to see if the system<br />

crashed. If no other messages appear, and you just typed Stop-A or plugged in a<br />

new keyboard, type go to continue. You might need to Refresh the window system<br />

from its Workspace Menu.<br />

Technical Notes<br />

Never invoke sync from the ok prompt without first running the fsck(1M) command,<br />

especially if the filesystem has changed.<br />

Alphabetical Message Listing 145


Operation already in progress<br />

Cause<br />

An operation was attempted on a non-blocking object that already had an operation<br />

in progress.<br />

Technical Notes<br />

The symbolic name for this error is EALREADY, errno=149.<br />

Operation canceled<br />

Cause<br />

The associated asynchronous operation was canceled before completion.<br />

Technical Notes<br />

The symbolic name for this error is ECANCELED, errno=47.<br />

operation failed [error 185], unknown<br />

group error 0, <strong>string</strong><br />

Cause<br />

When you use admintool to add a user to a newly-created group, admintool issues this<br />

error.<br />

Action<br />

Apply patch 101384-05 to fix bug ID 1151837 and to provide a workaround for bug<br />

ID 1153087.<br />

146 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Operation not applicable<br />

Cause<br />

This error indicates that no system support exists for some function that the<br />

application requested.<br />

Action<br />

Ask the system vendor for an upgrade, or contact the vendor or author of the<br />

application for an update.<br />

Technical Notes<br />

This message indicates that no system support exists for an operation. Many<br />

modules set this error when a programming function is not yet implemented. If you<br />

are writing a program that produces this message while calling a system library, try<br />

to find and use an alternative library function. Future versions of the system might<br />

support this operation; check system release notes for further information.<br />

The symbolic name for this error is ENOSYS, errno=89.<br />

Operation not supported on transport<br />

endpoint<br />

Cause<br />

For example, trying to accept a connection on a datagram transport endpoint.<br />

Technical Notes<br />

The symbolic name for this error is EOPNOTSUPP, errno=122.<br />

Alphabetical Message Listing 147


Operation now in progress<br />

Cause<br />

An operation that takes a long time to complete (such as a connect) was attempted on<br />

a non-blocking object.<br />

Technical Notes<br />

The symbolic name for this error is EINPROGRESS, errno=150.<br />

/opt/bin/jws: /solaris/bin/locate_dirs:<br />

not found<br />

Cause<br />

This error message occurred when the customer used the link from /opt/bin/jws<br />

to /opt/SUNWjws/JWS/sparc-S2/bin/jws, to start Java Workshop. Typing in the<br />

full pathname works fine, typing jws gives the error.<br />

Action<br />

This happens because /opt/bin/jws is not<br />

/opt/SUNWjws/JWS/sparc-S2/bin/jws, which a script that runs another script:<br />

$_SS_JWS_HOME/solaris/bin/locate_dirs.<br />

So whatever /opt/bin/jws is, it is not setting $_SS_JWS_HOME correctly. Take that<br />

out of your path and put /opt/SUNWjws/JWS/sparc-S2/bin/jws in your path<br />

so which jws returns: /opt/SUNWjws/JWS/sparc-S2/bin/jws.<br />

Option not supported by protocol<br />

Cause<br />

A bad option or level was specified when getting or setting options for a protocol.<br />

148 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Technical Notes<br />

The symbolic name for this error is ENOPROTOOPT, errno=99.<br />

out of memory<br />

Cause<br />

Hundreds of different programs can produce this message when the system is<br />

running many large applications simultaneously. This message usually means that<br />

the system has run out of swap space (virtual memory).<br />

Action<br />

See the message "Not enough space" for details. Any data written during this<br />

condition will probably be lost.<br />

Out of stream resources<br />

Cause<br />

During a STREAMS open, either no STREAMS queues or no STREAMS head data<br />

structures were available. This is a temporary condition; one may recover from it if<br />

other processes release resources.<br />

Technical Notes<br />

The symbolic name for this error is ENOSR, errno=63.<br />

overlapping swap volume<br />

Cause<br />

After creating volumes in rootdg to be used as additional swap and adding these to<br />

the /etc/vfstab file, an error message is displayed at boot time complaining about<br />

overlapping swap volumes.<br />

Alphabetical Message Listing 149


Action<br />

Change the names of these volumes to read swap1, swap2, etc...<br />

If you still get this message after making the above change, edit the<br />

/sbin/swapadd script. Find the line:<br />

c=‘$SWAP -l | grep -c ’\\’‘<br />

and change it to<br />

=‘$SWAP -l | grep -c ’’${special}’’‘<br />

Also see bug number 1215062 for more information on this behavior.<br />

"P"<br />

Package not installed<br />

Cause<br />

This error occurs when users attempt to use a system call from a package which has<br />

not been installed.<br />

Technical Notes<br />

The symbolic name for this error is ENOPKG, errno=65.<br />

panic -boot: Could not mount<br />

filesystem<br />

Cause<br />

The primary problem is that jumpstart gives the following error:<br />

2ec00 RPC: Can’t decode result.<br />

whoami RPC call failed with rpc status: 2<br />

(continued)<br />

150 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


(Continuation)<br />

panic - boot: Could not mount filesystem.<br />

program terminated<br />

ok<br />

Normally this is a result of the bootparams not being able to get to the install<br />

image.<br />

The second problem is that other users have had the same error message, with the<br />

additional message:<br />

’Timeout waiting for ARP/RARP packet...’<br />

Action<br />

To solve the first problem:<br />

1. Check how the dfstab(4) (/etc/dfs/dfstab on the install image NFS server)<br />

looks:<br />

share -F nfs -o ro,anon=o /jumpstart-dir<br />

2. Run share(1M) command on the installed image NFS server, to make sure it is<br />

shared properly.<br />

3. Check /etc/bootparams file on the net install server. Look for entries with<br />

incorrect boot path.<br />

4. Make sure that /usr/sbin/rpc.bootparamd is running on the boot server. If<br />

necessary, kill and restart it.<br />

5. Check /etc/ethers on the boot server for duplicate or conflicting entries<br />

6. At OK prompt, run test net /test-net and/or watch net /watch-net to test the<br />

network connectivity<br />

A workaround for the second problem is to check the nsswitch.conf(4) file. If<br />

some of the entries point to NIS such as:<br />

rpc nis files<br />

hosts nis files<br />

ethers nis files<br />

bootparams files<br />

nis<br />

Alphabetical Message Listing 151


change all of these entries to files first:<br />

rpc files nis<br />

hosts files nis<br />

ethers files nis<br />

bootparams files nis<br />

Note - You may have to manually update these files if they do not contain info on<br />

the client machine you are trying to jumpstart.<br />

Then remove the client with rm_install_client(1M), remove the contents of<br />

tftpboot, and re-add the client:<br />

add_install_client -c /jumpstart-dir/profiles ’client name’ ’arch’<br />

panic: mutex_adaptive_exit<br />

Cause<br />

CD-ROM boot gets error:panic: mutex_adaptive_exit Documented above under<br />

message "panic_mutex". Same SRDB ID.<br />

Panic<br />

Cause<br />

A system panics and crashes when a program exercises an operating system bug.<br />

Although the crash might seem unfriendly to a user, the sudden stop actually<br />

safeguards the system and its data from further corruption.<br />

Along with bringing the operating system to a stop, the panic routine copies the<br />

memory contents in use to a dump device, recording critical information about the<br />

current state of the CPU from which the panic routine was called.<br />

Because the primary swap device is usually the default dump device, the primary<br />

swap device should be large enough to hold a complete image of memory. The<br />

system tries to reboot after the memory image is saved.<br />

If the system does not reboot successfully, consider these possibilities:<br />

1. Catastrophic hardware failure, such as faulty memory or a crashed disk;<br />

152 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


2. Major kernel configuration faults, such as a buggy device driver;<br />

3. Major kernel tuning errors, such as a too-large value for maxusers;<br />

4. Data corruption, including corruption of the operating system files;<br />

5. Manual intervention is needed, as when fsck(1M) expects answers to its queries.<br />

Action<br />

To find out why a system crashed, you can<br />

1. Look in the /var/adm/message* log files;<br />

Action<br />

Of these methods, using savecore(1M) is the most informative. The<br />

savecore(1M) command transfers the system crash dump image generated by the<br />

panic routine from the dump device to a file system. The image can then be analyzed<br />

with a debugger such as adb(1).<br />

See Also<br />

Correctly setting up savecore(1M) and interpreting its results can be difficult. For<br />

more information about debugging system panics, refer to Panic! UNIX System<br />

Crash Dump Analysis by Chris Drake and Kimberley Brown (ISBN 0-13-149386-8).<br />

PARTIALLY ALLOCATED INODE I=int CLEAR?<br />

Cause<br />

During phase 1, fsck(1M) found that the specified inode was neither allocated nor<br />

unallocated. The reason is probably that the system crashed in the middle of a<br />

sync(2) or write(2) operation.<br />

Action<br />

Should you answer yes to this question, "UNALLOCATED" messages might result<br />

during phase 2, if any directory entries point to this inode. If you are being careful,<br />

exit fsck(1M) and run ncheck(1M) (specifying the inode number after the -i<br />

option) to determine which file or directory is involved here. You might be able<br />

restore this file or directory from another system. It is also possible that fsck(1M)<br />

will copy this file to the lost+found directory in a later phase.<br />

Alphabetical Message Listing 153


See Also<br />

For more information, see the chapter on checking filesystem integrity in the System<br />

Administration Guide, Volume I.<br />

passwd: Changing password for <strong>string</strong><br />

Cause<br />

If you put the following lines into /etc/nsswitch.conf<br />

passwd: compat<br />

passwd_compat: nis<br />

Then when you run passwd, it fails as follows:<br />

server1% passwd<br />

passwd: Changing password for khh<br />

server1%<br />

NOTE: passwd exits before you get the opportunity to enter a password.<br />

Action<br />

If you read the man page for passwd very carefully, you will see the following:<br />

If all requirements are met, by default, the passwd command will consult<br />

/etc/nsswitch.conf to determine in which repositories to perform password<br />

update. It searches the passwd(4) and passwd_compat entries. The sources<br />

(repositories) associated with these entries will be updated. However, the password<br />

update configurations supported are limited to the following 5 cases. Failure to<br />

comply with the configurations will prevent users from logging onto the system.<br />

o passwd: files<br />

o passwd: files nis<br />

o passwd: files nisplus<br />

o passwd: compat (==> files nis)<br />

o passwd: compat (==> files nisplus)<br />

passwd_compat: nisplus<br />

154 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


NOTE: It does *not* say that you can use the line: passwd_compat: nis. If you<br />

conform exactly to what’s written in the man page, then passwd(1) works.<br />

passwd.org_dir: NIS+ servers<br />

unreachable<br />

Cause<br />

This is the first of three messages that an NIS+ client prints when it cannot locate an<br />

NIS+ server on the network.<br />

Action<br />

See the message "hosts.org_dir: NIS+ servers unreachable" for details.<br />

Password does not decrypt secret key<br />

for unix.uid@<strong>string</strong><br />

Cause<br />

This message appears at login time when a user’s password is not identical to the<br />

user’s keylogin(1) network password. When a system is running NIS+, the login<br />

program first performs UNIX authentication, and then attempts a keylogin(1) for<br />

secure RPC authentication.<br />

Action<br />

To gain credentials for secure RPC, users can run keylogin(1) (after login) and<br />

type in their secret key. To stop this message from appearing at login time, users can<br />

run the chkey -p command and set their network password to be the same as their<br />

NIS+ password. If a user doesn’t remember the network password, the system<br />

administrator should delete and re-create the user’s credentials table entry so the<br />

user can establish a new network password with chkey(1).<br />

Alphabetical Message Listing 155


Permission denied<br />

Cause<br />

An attempt was made to access a file in a way forbidden by the protection system.<br />

Action<br />

Check the ownership and protection mode of the file (with a long listing from the<br />

ls -l command) to see who is allowed to access the file. Then change the file or<br />

directory permissions as needed.<br />

Technical Notes<br />

The symbolic name for this error is EACCES, errno=13.<br />

Please specify a recipient.<br />

Cause<br />

With mailtool(1), this message comes up in a dialog box whenever a user tries to<br />

deliver a message with no address in the To: field.<br />

Action<br />

See the message "Recipient names must be specified" for details.<br />

Protocol error<br />

Cause<br />

Some protocol error occurred. This error is device specific, but is generally not<br />

related to a hardware failure.<br />

Technical Notes<br />

The symbolic name for this error is EPROTO, errno=71.<br />

156 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


protocol error, <strong>string</strong> closed connection<br />

Cause<br />

rlogin(1) fails on SunOS machine.<br />

Action<br />

1. Check the permissions on in.rlogind on the machine you are trying to connect to.<br />

The permissions should look like this:<br />

-rwxr-xr-x 1 root staff 16384 Jan 20 1994 /usr/etc/in.rlogind<br />

2. Check the login line in the /etc/inetd.conf file. It should look like the following:<br />

login stream tcp nowait root /usr/etc/in.rlogind in.rlogind<br />

3. Check /etc/passwd to see if an invalid login shell has been substituted in the<br />

entry for the login id.<br />

Protocol family not supported<br />

Cause<br />

The protocol family has not been configured into the system or no implementation<br />

for it exists. Used for the Internet protocols.<br />

Technical Notes<br />

The symbolic name for this error is EPFNOSUPPORT, errno=123.<br />

Alphabetical Message Listing 157


Protocol not supported<br />

Cause<br />

The requested networking protocol has not been configured into the system, or no<br />

implementation for it exists. (A protocol is a formal description of the messages to be<br />

exchanged and the rules to be followed when systems exchange information.)<br />

Action<br />

Verify that the protocol is in the /etc/inet/protocols file and in the NIS<br />

protocols map, if applicable. If the protocol is not listed, and you want to permit its<br />

use, configure the protocol as documented or as required.<br />

Technical Notes<br />

The symbolic name for this error is EPROTONOSUPPORT, errno=120.<br />

Protocol wrong type for socket<br />

Cause<br />

This message indicates either an application programming error, or badly configured<br />

protocols.<br />

Action<br />

Make sure that the /etc/protocols file corresponds number-for-number with the<br />

NIS protocols(4) map. It it does, ask the vendor or author of the application for<br />

an update.<br />

Technical Notes<br />

A protocol was specified that does not support the semantics of the socket type<br />

requested. This amounts to a request for an unsupported type of socket. Look at the<br />

source code that made this socket request and check that it requested one of the<br />

types specified in /usr/include/sys/socket.h.<br />

The symbolic name for this error is EPROTOTYPE, errno=98.<br />

158 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


"Q"<br />

quotactl: open Is a directory<br />

Cause<br />

Using edquota to set users limits the command displays the error.<br />

edquota updates all quota files that are on a mounted filesystem. So, a directory<br />

named quotas causes it to fail.<br />

Action<br />

In one of the mounted filesystems, there is a directory named quotas. To fix the<br />

problem, move the directory off the mounted filesystem, rename it, or delete it.<br />

For example: If you have /usr/quotas/old_info The directory /usr/quotas is<br />

going to cause edquota to fail. Either move /usr/quotas to /usr/old_quotas<br />

or delete the directory.<br />

"R"<br />

Read error from network: Connection<br />

reset by peer<br />

Cause<br />

This message appears when a user logs remotely into a machine that crashes or s<br />

rebooted during the rlogin(1) or rsh(1) session. Any data changes that were not<br />

saved are probably lost. Sometimes this message appears only when the user types<br />

something, even though the system went down hours before.<br />

Action<br />

Try to rlogin(1) again, perhaps after waiting a few minutes for the system to reboot.<br />

Alphabetical Message Listing 159


Read-only file system<br />

Cause<br />

Files and directories on filesystems that are mounted read-only cannot be changed.<br />

Action<br />

If you only modify these files and directories occasionally, rlogin(1) to the servers<br />

from which the filesystems are mounted and change the files or directories there. If<br />

you change these files and directories frequently, mount(1M) the filesystems read/<br />

write.<br />

Technical Notes<br />

The symbolic name for this error is EROFS, errno=30.<br />

rebooting...<br />

Cause<br />

This message appears on the console to indicate that the machine is booting, either<br />

after the superuser issued a reboot(1M) command, or after a system panic if the<br />

EEPROM’s watchdog-reboot? variable is set to true.<br />

Action<br />

Allow the machine to boot itself. In case of a system panic, look above this message<br />

for other indications of what went wrong.<br />

Recipient names must be specified<br />

Cause<br />

Somebody sent mail without a valid recipient in the To: field, so sendmail(1M)<br />

could not deliver the mail message. Using mail(1), the recipient’s address might<br />

have been specified using spaces or non-alphanumeric characters. The mailtool(1) and<br />

160 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


mailx(1) commands try to prevent this by issuing "Please specify a recipient" or "No<br />

recipients specified" messages instead. If there is at least one valid recipient, each<br />

invalid recipient address will generate a "User unknown" message.<br />

Action<br />

Look in the sender’s dead.letter file for the automatically saved message, and<br />

have the originator send it again, this time specifying a recipient.<br />

See Also<br />

For more information about sendmail(1M), see the Mail Administration Guide.<br />

Reset tty pgrp from int to int<br />

Cause<br />

The C shell sometimes issues this message when it clears away the window process<br />

group after the user exits the window system. This can happen when the window<br />

system doesn’t clean up after itself.<br />

Action<br />

Proceed with your work. This message is only informational.<br />

Resource temporarily unavailable<br />

Cause<br />

This indicates that the fork(2) system call failed because the system’s process table<br />

is full, or that a system call failed because of insufficient memory or swap space. It is<br />

also possible that a user is not allowed to create any more processes.<br />

Action<br />

Simply waiting often gives the system time to free resources. However if this<br />

message occurs often on a system, reconfigure the kernel and allow more processes.<br />

Alphabetical Message Listing 161


To increase the size of the process table in Solaris 2, increase the value of maxusers<br />

in the /etc/system file. The default maxusers value is the amount of main<br />

memory in MB, minus 2.<br />

If one user is not allowed to create any more processes, that user has probably<br />

exceeded the memory size limit; see the limit(1) man page for details.<br />

Technical Notes<br />

The symbolic name for this error is EAGAIN, errno=11.<br />

Restartable system call<br />

Cause<br />

Interrupted system call should be restarted.<br />

Technical Notes<br />

The symbolic name for this error is ESTART, errno=91.<br />

Result too large<br />

Cause<br />

This is a programming error or a data input error.<br />

Action<br />

Ask the program’s author to fix this condition.<br />

Technical Notes<br />

This indicates an attempt to evaluate a mathematical programming function at a<br />

point where its value would overflow or underflow. The value of a programming<br />

function in the math package (3M) is not representable within machine precision.<br />

This could occur after floating point overflow or underflow (either single or double<br />

precision), or after total loss of numeric significance in Bessel functions.<br />

162 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


This message can indicate "Result too small" in the case of floating point underflow.<br />

To help pinpoint a program’s math errors, use the matherr(3M) facility.<br />

The symbolic name for this error is ERANGE, errno=34.<br />

rlogin: no directory! connection<br />

closed<br />

Cause<br />

When user tries to remotely login to a machine, he gets the error.<br />

The machine that the customer was trying to rlogin(1) to had permissions of 700<br />

on its root directory. The permissions on root should be 755.<br />

Once the user changed the rootpermissions to 755, he was able to get farther when<br />

attempting a rlogin, but it still failed:<br />

Last login: Fri Aug 29 10:24:43 from machinename<br />

no shell<br />

connection closed<br />

Action<br />

The machine that the user was trying to rlogin into had the permissions set to 700 on<br />

both the root and /usr/bin directories. For both directories, the permissions<br />

should be 775. Once the user changed the permissions to 775, rlogin(1) was<br />

successful.<br />

Another possibility is to check the user’s >passwd(1) entry in the NIS/NIS+ map. A<br />

login shell such as /usr/dist/exe/tcsh or /net/lab/.../csh could cause the<br />

failure because of NFS mount permission.<br />

Alphabetical Message Listing 163


mdir: <strong>string</strong>: Directory not empty<br />

Cause<br />

The rmdir(1) command can remove empty directories only. The directory whose<br />

name appears after the first colon in the message still contains some files or<br />

directories.<br />

Action<br />

Use rm(1) instead of rmdir(1). To remove this directory and everything underneath<br />

it, use the rm -ir command to recursively descend the directory, being asked if you<br />

want to delete each element. To remove the directory and all its contents without<br />

being asked for approval, use the rm -r command.<br />

ROOT LOGIN /dev/console<br />

Cause<br />

This syslog message indicates that someone has logged in as root on the system<br />

console.<br />

Action<br />

If you have just logged in as root, don’t worry. If this is not you, consider the<br />

possibility of a security breach. The best site-wide policy is for all system<br />

administrators to su(1M) instead of logging in as root.<br />

ROOT LOGIN /dev/pts/int FROM <strong>string</strong><br />

Cause<br />

This syslog message indicates that someone has logged in remotely as root on a<br />

pseudo-terminal from the system specified after the FROM keyword.<br />

164 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

For security reasons, it is a bad idea to allow root logins from anywhere other than<br />

the console. To restrict superuser logins to the console, remove the comment from the<br />

CONSOLE line in /etc/default/login.<br />

RPC: Program not registered<br />

Cause<br />

Check the rpc.bynumber NIS map.<br />

rx framing error<br />

Cause<br />

Usually this error indicates a hardware problem.<br />

Action<br />

Check the Ethernet cabling and connectors to locate a problem.<br />

Technical Notes<br />

A framing error occurs when the Ethernet I/O driver receives a non-integral unit of<br />

octets, such as 63 bytes and then 3 bits. (Ethernet specifies the use of octets.) Framing<br />

errors are caused by corruption of the starting or ending frame delimiters. These can<br />

be corrupted by some violation of the encoding scheme.<br />

Framing errors are a subset of CRC errors, which are usually caused by anomalies on<br />

the physical media. An "alignment/framing error" is a type of CRC error where octet<br />

boundaries do not line up.<br />

Alphabetical Message Listing 165


"S"<br />

save: SYSTEM error, Arg list too long<br />

Cause<br />

save fails with the error. The cause of this error is that the data base (index) file for<br />

the client is greater than 2 Gbytes. With Solaris 2.6 and SBU 5.0.1 this is no longer a<br />

problem.<br />

Action<br />

However, with earlier versions of Solaris you need to bring up nwadmin -> indexes<br />

-> select appropriate client -> select appropriate fs -> remove oldes cycle -> reclaim<br />

space.<br />

This may have to be repeated a few times to reclaim enough space. The indexes can<br />

be recreated later if necessary by using scanner.<br />

SCSI bus DATA IN phase parity error<br />

Cause<br />

The most common cause of this problem is unapproved hardware. Some SCSI<br />

devices for the PC market do not meet the high I/O speed requirements for the<br />

UNIX market. Other possible causes of this problem are improper cabling or<br />

termination, and power fluctuations. Data corruption is possible but unlikely to<br />

occur, because this parity error prevents data transfer.<br />

Action<br />

Check that all SCSI devices on the bus are Sun-approved hardware. Then verify that<br />

all cables measure no longer than six meters, total, and that all SCSI connections are<br />

properly terminated. If power fluctuations are occurring, invest in an uninterruptible<br />

power supply.<br />

166 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


SCSI transport failed: reason ’reset’<br />

Cause<br />

This message indicates that the system sent data over the SCSI bus, but the data<br />

never reached its destination because of a SCSI bus reset. The most common cause of<br />

this condition is conflicting SCSI targets.¤ Data corruption is possible but unlikely to<br />

occur, because this failure prevents data transfer.<br />

Action<br />

Verify that all cables measure no longer than six meters, total, and that all SCSI<br />

connections are properly terminated. If power surges are a problem, acquire a surge<br />

suppressor or uninterruptible power supply.<br />

A machine’s internal disk drive is usually SCSI target 3. Make sure that external and<br />

secondary disk drives are targeted to 1, 2, or 0, and do not conflict with each other.<br />

Also make sure that tape drives are targeted to 4 or 5, and CD drives to 6, avoiding<br />

any conflict with each other or with disk drives. If the targeting of the internal disk<br />

drive is in question, power off the machine, remove all external drives, turn the<br />

power on, and from the PROM monitor run the probe-scsi-all or probe-scsi<br />

command.<br />

If SCSI device targeting is acceptable, memory configuration could be the problem,<br />

especially for machines with the sun4c architecture. Ensure that high-capacity<br />

memory chips (such as 4MB SIMMs) are in lower banks, while lower-capacity<br />

memory chips (such as 1MB SIMMs) are in the upper banks.<br />

SPARC systems do not always support third party CD-ROM drives, and may<br />

generate a similar "unknown vendor" error message. Check with the CD-ROM<br />

vendor for specific configuration requirements.<br />

Some third-party disk drives have a read-ahead cache that interferes with Solaris<br />

device drivers. Make sure that any existing read-ahead cache facility is turned off.<br />

See Also<br />

¤ For more information on SCSI targets, see the section on device naming<br />

conventions in the Solaris 1.x to Solaris 2.x Transition Guide. If you are using the<br />

AnswerBook, "scsi targets" is a good search <strong>string</strong>.<br />

Alphabetical Message Listing 167


Security exception on host <strong>string</strong>. USER<br />

ACCESS DENIED.<br />

Cause<br />

When trying to create a user via Adminsuite, placing the home directory on a<br />

system remote from the nisplus server, customer gets error message:<br />

Security exception on host hostname. USER ACCESS DENIED.<br />

The user identity (555)username was received, but that user<br />

is not authorized to execute the requested functionality<br />

on this system. Is this user a member of an appropriate<br />

security group on this system ?<br />

(Function: class directory method create_dir)<br />

User can rsh(1) to the remote machine and create a home directory on the system.<br />

Action<br />

The user was not in the sysadmin group NIS+ tables.<br />

# niscat group.org_dir | grep sysadmin<br />

sysadmin::14:<br />

Add the username to the sysadmin group.<br />

Segmentation Fault<br />

Cause<br />

Segmentation faults usually result from programming error. This message is usually<br />

accompanied by a core dump, except on read-only filesystems.<br />

Action<br />

To see which program produced a core(4) file, run either the file(1) command or<br />

the adb(1) command. The following examples show the output of the file(1) and<br />

adb(1) commands on a core file from the dtmail program.<br />

168 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


$ file core<br />

core: ELF 32-bit MSB core file SPARC Version 1, from ‘dtmail’<br />

$ adb core<br />

core file = core -- program ‘dtmail’<br />

SIGSEGV 11: segmentation violation<br />

^D (use Control-d to quit the adb rogram)<br />

Ask the vendor or author of this program for a debugged version.<br />

Technical Notes<br />

A process has received a signal indicating that it attempted to access an area of<br />

memory that is protected or that does not exist. The two most common causes of<br />

segmentation faults are attempting to dereference a null pointer or indexing past the<br />

bounds of an array.<br />

sendmail[]: can’t lookup data via name<br />

server "dns" or<br />

sendmail[]: can’t lookup data via name<br />

server "nis"<br />

Cause<br />

The following entry in the /etc/nsswitch.conf file:<br />

sendmailvars: dns nis files causes the messages to appear in the console<br />

window.<br />

Action<br />

The sendmailvars database can only be used with local files and/or NIS+. So, if<br />

you do not have this database setup, the default sendmailvars entry should look<br />

as follows in the /etc/nsswitch.conf file:<br />

sendmailvars: files<br />

Alphabetical Message Listing 169


sendmail[int]: NOQUEUE: SYSERR: net hang<br />

reading from <strong>string</strong><br />

Cause<br />

This is a sendmail(1M) message that appears on the console and in the log file<br />

/var/adm/messages. If this message occurs once for a particular user, it is possible<br />

that a mail message from this user ends with a partial line (having no terminating<br />

newline character). If this message appears frequently or at busy times, especially<br />

along with other networking errors, it could indicate network problems.<br />

Action<br />

Check the user’s mail spool file to see if a message ends without a newline character.<br />

If so, talk with the user and determine how to prevent the problem from occurring<br />

again. If these messages are the result of network problems, you could try moving<br />

the mail spool directory to another machine with a faster network interface.<br />

Technical Notes<br />

During the SMTP receipt of DATA phase, a message-terminating period on a line of<br />

its own never arrived, so sendmail(1M) timed out and produced this error.<br />

Service wouldn’t let us acquire<br />

selection<br />

Cause<br />

This message indicates that the OpenWindows selection service failed to seize the<br />

requested selection from /tmp/winselection. Some diagnostics follow: the<br />

requested selection could be 0 for unknown, 1 for caret, 2 for primary, 3 for<br />

secondary, or 4 for clipboard. The result could be 0 for failure, 2 for nonexistent, 3 for<br />

didn’t have, 4 for wrong rank, 5 for continued, 6 for cancelled, or 7 for unrecognized.<br />

170 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


setmnt: Cannot open /etc/mnttab for<br />

writing<br />

Cause<br />

The system is having problems writing to /etc/mnttab. The filesystem containing<br />

/etc may be mounted read-only, or is not mounted at all.<br />

Action<br />

Check that this file exists and is writable by root. If so, ensure that the /etc<br />

filesystem has been mounted, and is mounted read-write rather than read-only.<br />

share_nfs: /home: Operation not<br />

applicable<br />

Cause<br />

This message usually indicates that the system has a local filesystem mounted on<br />

/home, which is where the automounter usually mounts users’ home directories.<br />

Action<br />

When a system is running the automounter, do not mount local filesystems on the<br />

/home directory. Mount them on another directory, such as /disk2, which on most<br />

systems you will have to create. You could also change the automounter auto_home<br />

entry, but that is a more difficult solution.<br />

Slice c0t1d0s0 is too small to contain<br />

1 replicas<br />

Cause<br />

When trying to add a state replica using metatool to cylinder 0 of a disk, the<br />

following error message appears:<br />

Alphabetical Message Listing 171


Your attempt to attach metastate database<br />

replicas on slice "c?t?d?s?" failed for the<br />

following reason: Slice c?t?d?s? is too small<br />

to contain 1 replicas.<br />

This is because metatool masks out the very first cylinder to protect the disk label.<br />

On disksuite v4.1, metatool does allow adding the databases to cylinder 0 on<br />

2.1gb disks or larger.<br />

Action<br />

The workaround is not to start at cylinder 0 but at cylinder 1, or use the command<br />

line (metadb -a).<br />

Socket type not supported<br />

Cause<br />

The support for the socket type has not been configured into the system or no<br />

implementation for it exists.<br />

Technical Notes<br />

The symbolic name for this error is ESOCKTNOSUPPORT, errno=121.<br />

Soft error rate (int%) during writing<br />

was too high<br />

Cause<br />

This message from the SCSI tape drive appears when Exabyte or DAT tapes generate<br />

too many soft (recoverable) errors. It is followed by the advisory "Please, replace tape<br />

cartridge" message. Soft errors are an indication that hard errors could soon occur,<br />

causing data corruption.<br />

172 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

First clean the tape head with a cleaning tape as recommended by the manufacturer.<br />

If that doesn’t work, replace the tape cartridge. You might need to replace the tape<br />

drive if the problem still occurs with new tape cartridges.<br />

Soft error rate (retries = int) during<br />

writing was too high<br />

Cause<br />

This message from the SCSI tape drive appears when Archive tapes generate too<br />

many soft (recoverable) errors. It is followed by the advisory "Periodic head cleaning<br />

required and/or replace tape cartridge" message. Soft errors are an indication that<br />

hard errors could soon occur, causing data corruption.<br />

Action<br />

First clean the tape head with a cleaning tape as recommended by the manufacturer.<br />

If that doesn’t work, replace the tape cartridge. You might need to replace the tape<br />

drive if the problem still occurs with new tape cartridges.<br />

Software caused connection abort<br />

Cause<br />

A connection abort was caused internal to your host machine.<br />

Technical Notes<br />

The symbolic name for this error is ECONNABORTED, errno=130.<br />

Alphabetical Message Listing 173


Srmount error<br />

Cause<br />

This error is RFS specific. It occurs when an attempt is made to stop RFS while<br />

resources are still mounted by remote machines, or when a resource is readvertised<br />

with a client list that does not include a remote machine with the resource currently<br />

mounted.<br />

Technical Notes<br />

The symbolic name for this error is ESRMNT, errno=69.<br />

Stale NFS file handle<br />

Cause<br />

A file or directory that was opened by an NFS client was either removed or replaced<br />

on the server.<br />

Action<br />

If you were editing this file, write it to a local filesystem instead. Try remounting the<br />

filesystem on top of itself or shutting down any client processes that refer to stale file<br />

handles. If neither of these solutions works, reboot the system.<br />

Technical Notes<br />

The original vnode is no longer valid. The only way to get rid of this error is to<br />

force the NFS server and client to renegotiate file handles.<br />

The symbolic name for this error is ESTALE, errno=151.<br />

174 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


statd: cannot talk to statd at <strong>string</strong><br />

Cause<br />

This message comes from the NFS status monitor daemon statd(1M), which<br />

provides crash recovery services for the NFS lock daemon lockd(1M). The message<br />

indicates that statd(1M) has left old references in the /var/statmon/sm and<br />

/var/statmon/sm.bak directories. After a user has removed or modified a host in<br />

the hosts database, statd(1M) might not properly purge files in these directories,<br />

which results in its trying to communicate with a nonexistent host.<br />

Action<br />

Remove the file named variable (where variable is the hostname) from both the<br />

/var/statmon/sm and /var/statmon/sm.bak directories. Then kill the statd<br />

daemon and restart it. If that doesn’t get rid of the message, kill and restart<br />

lockd(1M) as well. If that doesn’t work, reboot the machine at your convenience.<br />

stty: TCGETS: Operation not supported<br />

on socket<br />

Cause<br />

This message results when a user tries to remote copy with rcp(1) or remote shell<br />

with rsh(1) from one machine to another, but has an stty(1) command in the<br />

remote .cshrc file. This error results in failure of the rcp(1) or rsh(1) command.<br />

Action<br />

The solution is to move the invokation of the stty(1) command to the user’s<br />

.login (or equivalent) file. Alternatively, execute the stty(1) command in .cshrc<br />

only when the shell is interactive. Here is a test to do that:<br />

if ($?prompt) stty ...<br />

Alphabetical Message Listing 175


Technical Notes<br />

The rcp(1) and rsh(1) commands make a connection using sockets, which do not<br />

support stty(1)’s TCGETS ioctl.<br />

su: No shell<br />

Cause<br />

This message indicates that someone changed the default login shell for root to a<br />

program that is missing from the system. For example, the final colon-separated field<br />

in /etc/passwd could have been changed from /sbin/sh to /usr/bin/bash,<br />

which does not exist in that location. Possibly an extra space was appended at the<br />

end of the line. The outcome is that you cannot login as root or switch user to<br />

root, and so cannot directly fix this problem.<br />

Action<br />

The only solution is to reboot the system from another source, then edit the<br />

password file to correct this problem. Invoke sync(1M) several times, then halt the<br />

machine by typing Stop-A or by pressing the reset button. Reboot single-user from<br />

CD-ROM, the net, or diskette, such as by typing boot cdrom -s at the ok prompt.<br />

After the system comes up and gives you a # prompt, mount the device<br />

corresponding to the original / partition somewhere, such as with a mount(1M)<br />

command similar to the one below. Then run an editor on the newly-mounted<br />

system password file (use ed(1) if terminal support is lacking):<br />

# mount /dev/dsk/c0t3d0s0 /mnt<br />

# ed /mnt/etc/passwd<br />

Use the editor to change the password file’s root entry to call an existing shell, such<br />

as /usr/bin/csh or /usr/bin/ksh.<br />

Technical Notes<br />

To keep the "No shell" problem from happening, habitually use admintool or<br />

/usr/ucb/vipw to edit the password file. These tools make it difficult to change<br />

password entries in ways that make the system unusable.<br />

176 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


SunPC may NOT run correctly as root<br />

Cause<br />

With SunPC 4.1 and the 102924 jumbo patch installed: when a user attempts to run<br />

SunPC, the following error message is displayed:<br />

SunPC may NOT run correctly as root.<br />

Please run in user mode.<br />

SunPC script is exiting<br />

Yet, the user is not root.<br />

The user’s primary group id is probably root. For example:<br />

$ /usr/bin/id<br />

uid=33650(gruff) gid=0(root)<br />

Action<br />

Change the user’s primary group to another group, such as 10, and, the user still<br />

needs to be in teh root group, add the root group to the user’s secondary group list.<br />

su: ’su root’ failed for <strong>string</strong> on<br />

/dev/pts/int<br />

Cause<br />

The user specified after "for" tried to become superuser, but typed the wrong<br />

password.<br />

Action<br />

If the user is supposed to know the root password, wait to see if the correct<br />

password is supplied. If the user is not supposed to know the root password, ask<br />

why he or she is attempting to become superuser.<br />

Alphabetical Message Listing 177


su: ’su root’ succeeded for <strong>string</strong> on<br />

/dev/pts/int<br />

Cause<br />

The user specified after "for" just became superuser by typing the root password.<br />

Action<br />

If the user is supposed to know the root password, this message is only<br />

informational. If the user is not supposed to know the root password, change this<br />

password immediately and ask how the user learned it.<br />

syncing file systems...<br />

Cause<br />

This indicates that the kernel is updating the super-blocks before taking the system<br />

down, to ensure filesystem integrity. This message appears after a halt(1M) or<br />

reboot(1M) command. It can also appear after a system panic, in which case the<br />

system might contain corrupted data.<br />

Action<br />

If you just halted or rebooted the machine, don’t worry– this message is normal. In<br />

case of a system panic, look up the panic messages that appear above this one. Your<br />

system vendor might be able to help diagnose the problem. So that you can describe<br />

the panic to the vendor, either leave your system in its panicked state or be sure that<br />

you can reproduce the problem.<br />

Technical Notes<br />

Numbers that sometimes display after the three dots in the message show the count<br />

of dirty pages that are being written out. Numbers in brackets show an estimate of<br />

the number of busy buffers in the system.<br />

178 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


SYSLOGD CAUSES SYSTEM HANGS<br />

Cause<br />

(Over and Over again = installpatch problems)<br />

syslog service starting.<br />

Cause<br />

During system reboot, this message might appear and the boot seems to hang. After<br />

starting syslogd(1M) service, the system runs /etc/rc2.d/S75cron, which in<br />

turn calls ps(1). Sometimes after an abrupt system crash /dev/bd.off becomes a<br />

link to nowhere, causing the ps(1) command to hang indefinitely.<br />

Action<br />

Reboot single user (for example with boot -s) and run ls -l /dev/bd* to see if<br />

this is the problem. If so, remove /dev/bd.off, then run bdconfig off or reboot<br />

with the -r (reconfigure) option.<br />

This is the most commonly reported situation that causes ps(1) to hang.<br />

System booting after fatal error FATAL<br />

Cause<br />

The system reboots automatically. Afterward, the messages file contains<br />

System booting after fatal error FATAL.<br />

The message is issued during a reboot after the system detects a hardware error.<br />

Things which can cause this are: UPA address parity error, Master queue overflows,<br />

DTAG parity errors, E-Cache tag parity errors, and Coherence errors.<br />

Action<br />

Use prtdiag(1M) to help identify failed hardware components. The errors indicate<br />

either have a bad CPU module or a bad system board.<br />

Alphabetical Message Listing 179


system hang<br />

Cause<br />

4.1.3C SBUS cards suffer system freeze<br />

SYSTEM HANGS DURING BOOT<br />

Cause<br />

When the user boots a system, it hangs after the boot up messages "root on," "swap<br />

on," and "dump on." After the system displays these messages, the LEDs will flash<br />

and the system hangs.<br />

This is the result of an earlier fsck that deleted devices under the /dev directory.<br />

Check for the /dev/console device and if it is missing, make one.<br />

"T"<br />

tar: /dev/rmt/0: No such file or<br />

directory<br />

Cause<br />

The default tape device /dev/rmt/0, or possibly the device specified by the TAPE<br />

environment variable, is not currently connected to the system, is not configured, or<br />

its hardware symbolic link is broken.<br />

Action<br />

List the files in the /dev/rmt directory to see which tape devices are currently<br />

configured. If none are configured, ensure that a tape device is correctly attached to<br />

the system, and reboot with the -r option to reconfigure devices.<br />

If tape devices other than /dev/rmt/0 are configured, you could specify one of<br />

them after the -f option of tar(1).<br />

180 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


tar: directory checksum error<br />

Cause<br />

This error message from tar(1) indicates that the checksum of the directory and the<br />

files it has read from tape does not match the checksum advertised in the header<br />

block. Usually this indicates the wrong blocking factor, although it could indicate<br />

corrupt data on tape.<br />

Action<br />

To resolve this problem, make certain that the blocking factor you specify on the<br />

command line (after -b) matches the blocking factor originally specified. If in doubt,<br />

leave out the block size and let tar(1) determine it automatically. If that doesn’t<br />

help, tape data could be corrupted.<br />

tar: tape write error<br />

Cause<br />

A physical write error has occurred on the tar(1) output file, which is usually a<br />

tape, although it could be a diskette or disk file. Look on the system console, where<br />

the device driver should provide the actual error condition. This might be a<br />

write-protected tape, a physical I/O error, an end-of-tape condition, or a file too large<br />

limitation.<br />

Action<br />

In the case of write-protected tapes, enable the write switch. For physical I/O errors,<br />

the best course of action is to replace the tape with a new one. For end-of-tape<br />

conditions, try using a higher density if the device supports one, or use cpio(1) or<br />

pax(1) for their multi-volume support. When encountering file too large limitations,<br />

use the parent shell’s limit(1) or ulimit(1) facility to increase the maximum file<br />

size.<br />

See Also<br />

For more information on tar tapes, see the section on copying UFS files in the System<br />

Administration Guide, Volume I.<br />

Alphabetical Message Listing 181


Text file busy<br />

Cause<br />

This can occur when an attempt was made to execute a pure-procedure program that<br />

is currently open for writing. It also occurs because of attempts to open for writing<br />

or to remove a pure-procedure program that is being executed. (This message is<br />

obsolete.)<br />

Technical Notes<br />

The symbolic name for this error is ETXTBSY, errno=26.<br />

Text is lost because the maximum edit<br />

log size has been exceeded.<br />

Cause<br />

This message appears at the beginning of a cmdtool(1) session after 100,000 characters<br />

have gone by in the scrolling window. Clicking on the top rectangle of the scrollbar<br />

might display this message. No data were lost, but the user cannot scroll back before<br />

this wraparound point.<br />

Action<br />

To increase the maximum size of the Command Tool log file, use cmdtool -M,<br />

specifying more than 100,000 bytes.<br />

THE FOLLOWING FILE SYSTEM(S) HAD AN<br />

UNEXPECTED INCONSISTENCY:<br />

Cause<br />

At boot time the /etc/rcS script runs the fsck(1M) command to check the<br />

integrity of filesystems marked "fsck" in /etc/vfstab. If fsck(1M) cannot repair a<br />

filesystem automatically, it interrupts the boot procedure and produces this message.<br />

When fsck(1M) gets into this state, it cannot repair filesystems without losing one<br />

182 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


or more files, so it wants to defer this responsibility to you, the administrator. Data<br />

corruption has probably already occurred.<br />

Action<br />

First run fsck -n on the filesystem, to see how many and what type of problems<br />

exist. Then run fsck(1M) again to repair the filesystem. If you have a backup of the<br />

filesystem, you can generally answer "y" to all the fsck(1M) questions. It’s a good<br />

idea to keep a record of all problematic files and inode numbers for later reference.<br />

To run fsck(1M) yourself, specify options as recommended by the boot script. For<br />

example:<br />

# fsck /dev/rdsk/c0t4d0s0<br />

Usually, files lost during fsck(1M) repair were created just before a crash or power<br />

outage, and cannot be recovered. If important files are lost, you can recover them<br />

from backup tapes.<br />

If you don’t have a backup, ask an expert to run fsck(1M) for you.<br />

See Also<br />

For more information, see the section on checking filesystem integrity in the System<br />

Administration Guide, Volume I.<br />

The SCSI bus is hung. Perhaps an<br />

external device is turned off.<br />

Cause<br />

This message appears near the beginning of rebooting, immediately after a "Boot<br />

device: ..." message, then the system hangs. The problem is conflicting SCSI targets<br />

for a non-boot device. Having an external device turned off is unlikely to cause this<br />

problem.<br />

Action<br />

See the message "Boot device: /iommu/sbus/<strong>string</strong>/<strong>string</strong>/sd@3,0" for a solution.<br />

Alphabetical Message Listing 183


See Also<br />

For more information, see the section on halting and booting in the System<br />

Administration Guide, Volume I.<br />

THE SYSTEM IS BEING SHUT DOWN NOW !!!<br />

Cause<br />

This message means the system is going down immediately and it’s too late to save<br />

any changes.<br />

Action<br />

This message is often preceded by messages telling you that the system is going<br />

down in 15 minutes, 10 minutes, and so on. When you see these initial broadcast<br />

shutdown messages, save all your work, send any email you’re working on, and<br />

close your files. Fortunately vi(1) sessions are automatically saved for later recovery,<br />

but many other applications have no crash protection mechanism. Data loss is likely.<br />

See Also<br />

For more information on shutting down the system, see the System Administration<br />

Guide, Volume I. If you are using the AnswerBook, "halting the system" is a good<br />

search <strong>string</strong>.<br />

The system will be shut down in int<br />

minutes<br />

Cause<br />

This message from the system shutdown(1M) script informs you that the superuser<br />

is taking down the system.<br />

Action<br />

Save all changes now or your work will be lost. Write out any files you were<br />

changing, send any email messages you were composing, and close your files.<br />

184 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


See Also<br />

For more information on shutting down the system, see the System Administration<br />

Guide, Volume I. If you are using the AnswerBook, "halting the system" is a good<br />

search <strong>string</strong>.<br />

This gateway does not support Unix<br />

Password.<br />

Cause<br />

While using Firewall v2.0, the following sequence happpens:<br />

# telnet firewall-machine<br />

Trying 192.29.174.60 ...<br />

Connected to firewall-machine<br />

Escape character is ’^]’.<br />

CheckPoint FireWall-1 authenticated Telnet server running on<br />

firewall-machine<br />

Login: testuser<br />

This gateway does not support Unix Password.<br />

Action<br />

Under Network Objects, edit your Gateway object, Host Properties Auth Schemes<br />

and select Unix Password. UNIX Password is not checked by default as it is<br />

considered an unsecure method of authentication.<br />

This mail file has been changed by<br />

another mail reader.<br />

Cause<br />

This message appears in a pop-up dialog box whenever you start mailtool(1) while<br />

another mail reader has the inbox locked. A question follows: "Do you wish to ask<br />

that mail reader to save the changes?" You are given three choices.<br />

Alphabetical Message Listing 185


Action<br />

If you choose "Save Changes," mailtool(1) requests the other mail reader to relinquish<br />

its lock and write out any changes it has made to your inbox. If you choose "Ignore,"<br />

mailtool(1) reads your inbox without locking it. If you choose "Cancel" mailtool(1) exits.<br />

Timeout waiting for ARP/RARP packet<br />

Cause<br />

This problem can occur while booting from the net, and indicates a network<br />

connection problem.<br />

Action<br />

Make sure the Ethernet cable is connected to the network. Check that this system has<br />

an entry in the NIS ethers(4) map or locally on the boot server. Then check the IP<br />

address of the server and the client to make sure they are on the same subnet. Local<br />

/etc/hosts files must agree with one another and with the NIS hosts(4) map.<br />

If those conditions are not causing the problem, go to the system’s PROM monitor<br />

ok prompt and run test net to test the network connection. (On older PROM<br />

monitors, use test-net instead.) If the network test fails, check the Ethernet port,<br />

card, fuse, and cable, replacing them if necessary. Also check the twisted pair port to<br />

make sure it is patched to the correct subnet.<br />

See Also<br />

For more information on packets, see SPARC: Installing Solaris Software. If you are<br />

using the AnswerBook, "ARP/RARP" is a good search <strong>string</strong>.<br />

Timer expired<br />

Cause<br />

The timer set for a STREAMS ioctl call has expired. The cause of this error is<br />

device specific and could indicate either a hardware or software failure, or perhaps a<br />

timeout value that is too short for the specific operation. The status of the ioctl(2)<br />

operation is indeterminate. This is also returned in the case of<br />

_lwp_cond_timedwait(2) or cond_timedwait(3T).<br />

186 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Technical Notes<br />

The symbolic name for this error is ETIME, errno=62.<br />

token ring hangs<br />

Cause<br />

4.1.3C SBUS cards suffer system freeze<br />

Too many links<br />

Cause<br />

An attempt was made to create more than the maximum number of hard links<br />

(LINK_MAX, by default 32767) to a file. Because each subdirectory is a link to its<br />

parent directory, the same error results from trying to create too many subdirectories.<br />

Action<br />

Check to see why the file has so many links to it. To get more than the maximum<br />

number of hard links, use symbolic links instead.<br />

Technical Notes<br />

The symbolic name for this error is EMLINK, errno=31.<br />

Too many open files<br />

Cause<br />

A process has too many files open at once. The system imposes a per-process soft<br />

limit on open files, OPEN_MAX (usually 64), which can be increased, and a<br />

per-process hard limit (usually 1024), which cannot be increased.<br />

Alphabetical Message Listing 187


Action<br />

You can control the soft limit from the shell. In the C shell, use the limit(1)<br />

command to increase the number of descriptors. In the Bourne or Korn shells,<br />

use the ulimit -n command to increase the number of file descriptors.<br />

If the window system refuses to start new applications because of this error, increase<br />

the open file limit in your login shell before starting the window system.<br />

Technical Notes<br />

The symbolic name for this error is EMFILE, errno=24.<br />

Transport endpoint is already<br />

connected<br />

Cause<br />

A connect request was made on an already connected transport endpoint; or, a<br />

sendto(3XN) or sendmsg(3XN) transport endpoint specified a destination when<br />

already connected.<br />

Technical Notes<br />

The symbolic name for this error is EISCONN, errno=133.<br />

Transport endpoint is not connected<br />

Cause<br />

A request to send or receive data was disallowed because the transport endpoint is<br />

not connected and (when sending a datagram) no address was supplied.<br />

Technical Notes<br />

The symbolic name for this error is ENOTCONN, errno=134.<br />

188 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


TRAP 3E<br />

Cause<br />

Ultra system fails to boot with TRAP 3E. The system sometimes also displays bad<br />

magic number errors.<br />

This is caused by a bad superblock on the boot disk. Which, in turn, could have beee<br />

caused by a SCSI configuration problem.<br />

Action<br />

To fix:<br />

1. Check SCSI bus for illegal configuration, bad cables, and duplicate SCSI addresses;<br />

2. Boot from cdrom in single user.<br />

OK boot cdrom -sw<br />

3. Attempt to fsck(1M) boot disk. This will probably fail with a superblock error.<br />

# fsck /dev/rdsk/device<br />

4. Find out locations of alternate superblocks. BE SURE TO USE AN UPPERCASE<br />

-N. For example:<br />

# newfs -N /dev/rdsk/c0t0d0s0<br />

/dev/rdsk/c0t0d0s0: 2048960 sectors in 1348 cylinders of 19 tracks,<br />

80 sectors 1000.5MB in 85 cyl groups (16 c/g, 11.88MB/g, 5696 i/g)<br />

super-block backups (for fsck -F ufs -o b=#) at:<br />

32, 24432, 48832, 73232, 97632, 122032, 146432, 170832, 195232, 219632,<br />

244032, 268432, 292832, 317232, 341632, 366032, 390432, 414832, 439232,<br />

463632, 488032, 512432, 536832, 561232, 585632, 610032, 634432, 658832,<br />

683232, 707632, 732032, 756432, 778272, 802672, 827072, 851472, 875872,<br />

900272, 924672, 949072, 973472, 997872, 1022272, 1290672, ...<br />

5. Using an alternate superblock, fsck(1M) the disk. You may have to try more than<br />

one alternate superblock to get this to work. Pick a couple from the beginning, the<br />

middle, and the end.<br />

# fsck -o b= /dev/rdsk/c0t0d0s0<br />

Alphabetical Message Listing 189


6. The boot block is probably bad too. Restore it while we are booted from the cdrom.<br />

# /usr/sbin/installboot /usr/platform/architecture/lib/fs/ufs/bootblk /dev/rdsk/c0t0d0s0<br />

7. Reboot the O.S. Should come up now.<br />

# reboot<br />

"U"<br />

ufsdump 4mm commands<br />

Cause<br />

Dump syntax used with autoloader<br />

umount: warning: /<strong>string</strong> not in mnttab<br />

Cause<br />

This message results from the superuser attempting to unmount a filesystem that is<br />

not mounted. Subdirectories of filesystems, such as /var, cannot be unmounted.<br />

Action<br />

Run the mount(1M) or df(1M) command to see what filesystems are mounted. If<br />

you really want to unmount one of them, specify the existing mount point.<br />

190 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Unable to connect to license server.<br />

Inconsistent encryption code.<br />

Cause<br />

User receives the following error message. The only thing that had changed was the<br />

IP address of the machine.<br />

Action<br />

The IP address defined with ifconfig(1M) must match that in /etc/hosts. That<br />

is, if you change the machine’s IP address with ifconfig(1M) you must also<br />

change the machine’s entry in the /etc/hosts file.<br />

For machines with multiple interfaces, you must check and possibly update<br />

/etc/hostname.*.<br />

unable to get pty!<br />

Cause<br />

When trying to bring up a Terminal window (dtterm) in CDE, a popup appears<br />

stating: Unable to get pty!.<br />

Action<br />

This error is because dtterm is not able to open /dev/pts/int (where int is an<br />

integer). The reason they cannot open this file is grantpt(3C) failed to change the<br />

permissions on the file. grantpt(3C) failed because the binary<br />

/usr/lib/pt_chmod is not setuid root. The permissions on /usr/lib/pt_chmod<br />

must be 4111.<br />

To restore the correct permissions to pt_chmod, use the following command: (as<br />

root)<br />

# chmod 4111 /usr/lib/pt_chmod<br />

Alphabetical Message Listing 191


Unable to install/attach driver ’<strong>string</strong>’<br />

Cause<br />

These messages appear in /var/adm/messages at boot time, when the system tries<br />

to load drivers for devices the machine does not have.<br />

Action<br />

Despite the alarmist tone, this message is intended as only informational. You<br />

probably don’t want all these device drivers, because they make your system kernel<br />

larger, requiring more memory.<br />

Unable to open nwrecover, Error:<br />

nwrecover: NSR: please start a server<br />

on client_name<br />

Cause<br />

While trying to open the graphical recovery interface by running nwrecover from<br />

the client, the error was displayed.<br />

Action<br />

In this case, multiple networker servers existed and nwrecover could not determine<br />

which network server to use for the client.<br />

The server can be specified to the nwrecover command with the -s option.<br />

nwrecover -c client_name -s server_name<br />

Where -s server_name sets the NetWorker server and -c<br />

NetWorker client index.<br />

client_name sets the<br />

192 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


uname: error writing name when booting<br />

Cause<br />

Error is as follows: uname: error writing name when booting System cannot<br />

bootstrap.<br />

Action<br />

Boot off CD-ROM and check /etc/nodename. The file must contain exactly one line<br />

with the name of the system. No blank, or other, lines are allowed.<br />

undefined control<br />

Cause<br />

This message, prefaced by the file name and line number involved, is from the C<br />

preprocessor /usr/ccs/lib/cpp, and indicates a line starting with a sharp (#) but<br />

not followed by a valid keyword such as define or include.<br />

Action<br />

A piece of software might be running the C preprocessor on an initialization file that<br />

you thought was interpreted by a shell. In most shells, the sharp (#) indicates a<br />

comment. The C preprocessor considers comments to be anything between /* and */<br />

delimiters.<br />

Unmatched ‘<br />

Cause<br />

This message from the C shell csh(1) indicates that a user typed a command<br />

containing a backquote symbol (‘) without a closing backquote. Similar messages<br />

result from an unmatched single quote (’) or an unmatched double quote ("). Other<br />

shells generally give a continuation prompt when a command line contains an<br />

unmatched quote symbol.<br />

Alphabetical Message Listing 193


Action<br />

Correct the command line and try again. To continue typing on another line, give the<br />

C shell a backslash right before the newline.<br />

UNREF FILE I=i OWNER=o MODE=m SIZE=s<br />

MTIME=t CLEAR?<br />

Cause<br />

During phase 4, fsck(1M) discovered that the specified file was orphaned because<br />

the inode had no record of its pathname. In other words, the file was not connected<br />

with any directory.<br />

Action<br />

Answer yes to reconnect the file into the lost+found directory. Then contact the<br />

file’s owner to ask if you should send it back, and where to place it.<br />

See Also<br />

For more information, see the chapter on checking filesystem integrity in the System<br />

Administration Guide, Volume I.<br />

Use "logout" to logout.<br />

Cause<br />

This C shell message might come as a surprise to Bourne or Korn shell users<br />

accustomed to logging out with a Control-d.<br />

Action<br />

When ignoreeof is set, the C shell requires users to logout by typing logout(1)<br />

or exit(1). Write any modified files to disk before exiting.<br />

194 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


user unknown<br />

Cause<br />

When trying to mail to a user, the error Username... User unknown is displayed.<br />

The user is on the same system.<br />

Action<br />

Check for typo the entered e-mail address. Or, the user could be aliased to a<br />

non-existant e-mail address in /etc/mail/aliases or the user’s .mailrc file.<br />

You cannot mail to a user that has capital letters in its name. sendmail(1M)<br />

converts all the capital letters to lowercase before attempting to find the user. Since<br />

UNIX is case sensitive, it finds no username on the system with all lowercase letters,<br />

so it displays the User unknown message.<br />

Workaround: Make sure all usernames are composed of *only* lowercase letters.<br />

/usr/dt/bin/rpc.ttdbserverd:Child<br />

Status’ changed<br />

Cause<br />

While running CDE, the error in the console or /var/adm/messages file:<br />

Oct 19 04:41:00 darkcastle last message repeated 393 times<br />

Oct 19 04:41:01 darkcastle inetd[120]: /usr/dt/bin/rpc.ttdbserverd:Child Status Changed<br />

Action<br />

The fix is to create the following soft links:<br />

ln -s /usr/openwin/bin/rpc.ttdbserver /usr/dt/bin/rpc.ttdbserver<br />

ln -s /usr/openwin/bin/rpc.ttdbserverd /usr/dt/bin/rpc.ttdbserverd<br />

Alphabetical Message Listing 195


usr/openwin/bin/xinit: connection to<br />

X server lost<br />

Cause<br />

This means that the xinit(1) program, which sets up X11 resources and starts a<br />

window manager, failed to locate the X server process. Perhaps the user interrupted<br />

window system startup, or exited abnormally from OpenWindows (for example, by<br />

killing processes or by rebooting). It is possible that the X server crashed. Data loss is<br />

possible in some cases. Depending on process timing, this message might be normal<br />

when OpenWindows exits during a system reboot.<br />

Action<br />

The only solution is to exit and restart OpenWindows. You do not need to reboot the<br />

system unless it hangs and fails to give you a console prompt. To exit<br />

OpenWindows, select Workspace->Exit. To restart OpenWindows, type openwin(1) at<br />

the system prompt.<br />

/usr/ucb/cc: language optional software<br />

package not installed<br />

Cause<br />

When compiling some code for BSD compatibility the error happened after invoking<br />

usr/ucb/cc. The unbundled compiler (SPARCworks Professional C product) was<br />

installed in /opt.<br />

/usr/ucb/cc is a script which checks for the file /usr/ccs/bin/ucbcc and, if<br />

found, invokes it with appropriate library flags for BSD-compatibility compilation.<br />

/usr/ucb/cc is part of the package SUNWscpu. /usr/ccs/bin/ucbcc is<br />

supposed to be a symbolic link to /opt/SUNWspro/bin/acc, which is created<br />

during installation of the unbundled C compiler, SPROcc.<br />

Action<br />

Verify that you have the essential OS-bundled Developer packages, SUNWscpu,<br />

SUNWbtool, and the unbundled C compiler:- SPROcc. However, /usr/ccs/bin/<br />

196 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


ucbcc was missing on customer’s system. Evidently somewhere along the line this<br />

link was removed.<br />

Solve the problem by creating a new symbolic link:<br />

# ln -s /opt/SUNWspro/bin/acc /usr/ccs/bin/ucbcc<br />

Invoke usr/ucb/cc to verify this worked.<br />

Commands used to identify which packages contain particular components involved:<br />

craterlake% grep ucb/cc /var/sadm/install/contents<br />

/usr/ucb/cc f none 0555 bin bin 3084 50323 814621113 *SUNWscpu<br />

craterlake% ls -l /usr/ucb/cc<br />

-r-xr-xr-x 1 bin bin 3084 Oct 25 1995 /usr/ucb/cc<br />

craterlake% file !$<br />

file /usr/ucb/cc<br />

/usr/ucb/cc: executable /usr/bin/sh script<br />

craterlake% grep ucbcc /var/sadm/install/contents<br />

/usr/ccs/bin/ucbcc=/opt1/40/SUNWspro/SC4.0/bin/acc s none SPROcc SPROcc.2 SPROcc.5<br />

craterlake% file /usr/ccs/bin/ucbcc<br />

/usr/ccs/bin/<br />

ucbcc: ELF 32-bit MSB executable SPARC Version 1, dynamically linked, stripped<br />

craterlake% ls -l /usr/ccs/bin/ucbcc<br />

lrwxrwxrwx 1 root other 31 Aug 23 1996 /usr/ccs/bin/ucbcc -> /opt1/40/SUNWspro/SC4.0/bin/<br />

acc<br />

UX: userdel: error: Cannot update<br />

system files login cannot be deleted<br />

Cause<br />

When using userdel to delete a user:<br />

userdel -r userid<br />

If the root (/) filesystem is full, the error is displayed.<br />

Action<br />

Free up space on root (/) filesystem.<br />

Alphabetical Message Listing 197


"V"<br />

Value too large for defined data type<br />

Cause<br />

The user ID or group ID of an IPC object or file system object was too large to be<br />

stored in an appropriate member of the caller-provided structure.<br />

Action<br />

Run the application on a newer system, or ask the program’s author to fix this<br />

condition.<br />

Technical Notes<br />

This error occurs only on systems that support a larger range of user or group ID<br />

values than a declared member structure can support. This condition usually occurs<br />

because the IPC or file system object resides on a remote machine with a larger value<br />

of type uid_t, off_t, orgid_t than that of the local system.<br />

The symbolic name for this error is EOVERFLOW, errno=79.<br />

Volume Manager reports error:<br />

Cause<br />

After upgrading from VxVM 2.0 or 2.1 to 2.3, when attempting to run vxva, the<br />

volume manager GUI, you get the message:<br />

Volume Manager reports error:<br />

Configuration daemon can’t speak protocol version<br />

This message indicates that there is a version mismatch between the version of the<br />

volume manager daemon (vxconfigd) and the GUI (vxva) you are trying to run.<br />

198 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


For example, you are running the 2.3 version of vxconfigd, and trying to run an<br />

old (2.1) version of vxva.<br />

The most likely reason for this to happen is because the you are using the wrong<br />

path for vxva. For vxva versions 2.1 and below, the binary was found in<br />

/opt/vxva/bin, but starting with 2.1.1, the location was changed to<br />

/opt/SUNWvxva/bin.<br />

If you did not remove the old SUNWvxva package before installing the new 2.3<br />

version (which is a normal thing; you don’t NEED to remove the old package), you<br />

probably still have the old /opt/vxva/bin in your $PATH, and thus, you’re<br />

attempting to run the older version of vxva.<br />

Action<br />

Run the newer vxva program: /opt/SUNWvxva/bin/vxva. If that does work and<br />

you do not get the error message, simply remove /opt/vxva/bin/vxva from your<br />

path statement or remove the old version of vxva and create a symbolic link to the<br />

new version with the following two commands:<br />

# rm /opt/vxva/bin/vxva<br />

# ln -s /opt/SUNWvxva/bin/vxva /opt/vxva/bin/vxva<br />

vxconfigd error: segmentation fault<br />

Cause<br />

When the system boots, the vxconfid fails to start. It fails with a segmentation fault<br />

(core dump).<br />

vxconfigd error: segmentation fault<br />

[ vxvm warning: _illegal vminor encountered ]<br />

Alphabetical Message Listing 199


Action<br />

Check the date on the system using date(1) (/bin/date or /usr/bin/date). If<br />

the date on the system is very old (like 1970) or very far out in the future (like 2010),<br />

vxconfigd core dumps.<br />

Change the date on the system using /bin/date or /usr/bin/date and the<br />

vxconfigd will start like a champ.<br />

vxvm:vxslicer:ERROR unsupported disk<br />

layout<br />

Cause<br />

When trying to encapsulate a disk you get this error.<br />

Action<br />

You must meet the minimum requirements to encapsulate a disk:<br />

1. You must have two free, zero length, slices on the disk (no cylinders should be<br />

assigned to these slices.)<br />

2. You must have two free cylinders on the disk. These 2 cylinders must not be in<br />

use by any slice other than slice two.<br />

3. The two free cylinders must be located at the beginning or end of the drive.<br />

"W"<br />

WARNING: add_spec: No major number<br />

for sf<br />

Cause<br />

The system prints the following warning message while booting:<br />

SunOS Release 5.5.1 Version Generic_103640-03 [UNIX(R)<br />

System V Release 4.0]<br />

200 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


(Continuation)<br />

Copyright (c) 1983-1996, Sun Microsystems, Inc.<br />

WARNING: add_spec: No major number for sf<br />

The sf(7D) driver is specific for a Sun Enterprise Network Array (SENA), also<br />

known as a "photon".<br />

Action<br />

If there is no SENA attached to the system, the message can be safely ignored. To<br />

stop seeing the message comment out the last line in /kernel/drv/ssd.conf that<br />

references sf(7D).<br />

If you do this, and then later attach a SENA to your system, please remember to<br />

uncomment this line again.<br />

WARNING: Clock gained int days-- CHECK<br />

AND RESET THE DATE!<br />

Cause<br />

Each workstation contains an internal clock powered by a rechargeable battery. After<br />

the system is halted and turned off, the internal clock continues to keep time. When<br />

the system is powered on and reboots, the system notices that the internal clock has<br />

gained time since the workstation was halted.<br />

Action<br />

In most cases, especially if the power has been off for less than a month, the internal<br />

clock keeps the correct time, and you do not have to reset the date. Use the date(1)<br />

command to check the date and time on your system. If the date or time is wrong,<br />

become superuser and use the date(1) command to reset them.<br />

Alphabetical Message Listing 201


WARNING: No network locking on <strong>string</strong>:<br />

contact admin to install server change<br />

Cause<br />

The Solaris 2 mount(1M) command issues this message whenever it mounts a<br />

filesystem that doesn’t have NFS locking, such as a standard SunOS 4.1 exported<br />

filesytem. Data loss is possible in applications that depend on locking.<br />

Action<br />

On the remote SunOS 4.1 system, install the appropriate rpc.lockd jumbo patch to<br />

implement NFS locking. For SunOS 4.1.4, install patch #102264; for SunOS 4.1.3,<br />

install patch #100075; for earlier 4.1 releases, install patch #101817.<br />

WARNING: processor level 4 interrupt<br />

not serviced<br />

Cause<br />

This message is basically a diagnostic from the SCSI driver. Especially on machines<br />

with the sun4c architecture, it can appear on the console every 10 minutes or so.<br />

Action<br />

To reduce the frequency of this message, add this line near the bottom of the<br />

/etc/system file and reboot:<br />

set esp:esp_use_poll_loop=0<br />

Technical Notes<br />

You might also see this message repeatedly after manually removing a CD when it<br />

was busy. Don’t do this! To get the system back to normal, reboot the system with<br />

the -r (reconfigure) option.<br />

202 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


WARNING: /tmp: File system full, swap<br />

space limit exceeded<br />

Cause<br />

The system swap area (virtual memory) has filled up. You need to reduce swap<br />

space consumption by killing some processes or possibly by rebooting the system.<br />

Action<br />

See the message "Not enough space" for information about increasing swap space.<br />

WARNING: TOD clock not initialized--<br />

CHECK AND RESET THE DATE!<br />

Cause<br />

This message indicates that the Time Of Day (TOD) clock reads zero, so its time is<br />

the beginning of the UNIX epoch: midnight 31 December 1969. On a brand-new<br />

system, the manufacturer might have neglected to initialize the system clock. On<br />

older systems it is more likely that the rechargeable battery has run out and requires<br />

replacement.<br />

Action<br />

First replace the battery according to the manufacturer’s instructions. Then become<br />

superuser and use the date(1) command to set the time and date. On SPARC<br />

systems the clock is powered by the same battery as the NVRAM, so a dead battery<br />

also causes loss of the machine’s Ethernet address and host ID, which are more<br />

serious problems for networked systems.<br />

Alphabetical Message Listing 203


WARNING: Unable to repair the /<br />

filesystem. Run fsck<br />

Cause<br />

This message comes at boot time from the /etc/rcS script whenever it gets a bad<br />

return code from fsck(1M) after checking a filesystem. The message recommends<br />

an fsck(1M) command line, and instructs you to exit the shell when done to<br />

continue booting. Then the script places the system in single-user mode so<br />

fsck(1M) can be run effectively.<br />

Action<br />

See "/dev/rdsk/variable: UNEXPECTED INCONSISTENCY" for information about<br />

repairing UFS filesystems.<br />

See "THE FOLLOWING FILE SYSTEM(S) HAD AN UNEXPECTED<br />

INCONSISTENCY" for information about repairing non-UFS filesystems.<br />

Watchdog Reset<br />

Cause<br />

This fatal error usually indicates some kind of hardware problem. Data corruption on<br />

the system is possible.<br />

Action<br />

Look for some other message that might help diagnose the problem. By itself, a<br />

watchdog reset doesn’t provide enough information; because traps are disabled, all<br />

information has been lost. If all that appears on the console is an ok prompt, issue<br />

the PROM command below to view the final messages that occurred just before<br />

system failure:<br />

ok f8002010 wector p<br />

Yes, that word is wector, not vector.<br />

The result is a display of messages similar to those produced by the dmesg(1M)<br />

command. These messages can be useful in finding the cause of system failure.<br />

204 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Technical Notes<br />

This message doesn’t come from the kernel, but from the OpenBoot PROM monitor,<br />

a piece of Forth software that gives you the ok prompt before you boot UNIX. If the<br />

CPU detects a trap when traps are disabled (an unrecoverable error), it signals a<br />

watchdog. The OpenBoot PROM monitor detects the watchdog, issues this message,<br />

and brings down the system.<br />

Who are you?<br />

Cause<br />

Many networking programs can print this message, including from(1B), lpr(1B),<br />

lprm(1B), mailx(1), rdist(1), sendmail(1M), talk(1), and rsh(1). The<br />

command prints this message when it cannot locate a password file entry for the<br />

current user. This might occur if a user logged in just before the superuser deleted<br />

that user’s password entry, or if the network naming service fails for a user who has<br />

no entry in the local password file.<br />

Action<br />

If a user’s password file entry was accidentally deleted, restore it from backups or<br />

from another password file. If a user’s login name or user ID was changed, ask that<br />

user to logout and login again. If the network naming service failed, check the NIS<br />

server(s) and repair or reboot as necessary.<br />

Technical Notes<br />

There is a known problem (bug 1138025) with starting hundreds of rsh(1) processes<br />

on another machine. This message appears because rsh(1) hangs while binding to a<br />

reserved port, and responds too slowly to interact with the network naming service.<br />

Window Underflow<br />

Cause<br />

This message often occurs at boot time, sometimes along with a "Watchdog Reset"<br />

error. It comes from the OpenBoot PROM monitor, which was passed a processor<br />

trap from the hardware. This error indicates that some program tried to access a<br />

SPARC register window that wasn’t accessible from the processor.<br />

Alphabetical Message Listing 205


Action<br />

On some system architectures, specifically sun4c, the problem could be that different<br />

capacity memory chips are mixed together. Someone might have placed 1MB SIMMs<br />

in the same bank with 4MB SIMMs. If this is so, rearrange the memory chips. Make<br />

sure to put higher-capacity SIMMs in the first bank(s), and lower-capacity SIMMs in<br />

the remaining bank(s); never mix different capacity SIMMs in the same bank.<br />

The problem could also be that cache memory on the motherboard has gone bad and<br />

needs replacement. If main memory is installed correctly, try swapping the<br />

motherboard.<br />

Technical Notes<br />

The best way to isolate the problem is to look at the %pc register to see where it got<br />

its arguments, and why the arguments were bad. If you can reproduce the condition<br />

causing this message, your system vendor might be able to help diagnose the<br />

problem.<br />

"X"<br />

X connection to <strong>string</strong>:0.0 broken<br />

(explicit kill or server shutdown).<br />

Cause<br />

This means that the client has lost its connection to the X server. The "0.0" represents<br />

the display device, which is usually the console. This message can appear when a<br />

user is running an X application on a remote system with the DISPLAY set back to<br />

the original system and the remote system’s X server disappears, perhaps because<br />

someone exited X windows or rebooted the machine. It sometimes appears locally<br />

when a user exits the window system. Data loss is possible if applications were<br />

killed before saving files.<br />

Action<br />

Try to run the application again in a few minutes after the system has rebooted and<br />

the window system is running.<br />

206 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


xinit: not found<br />

Cause<br />

OpenWindows was probably not installed properly, and the openwin(1) program<br />

could not find xinit(1) to start the X windows system. If the user is running another<br />

version of X windows, such as the MIT X11 distribution, the startx program serves<br />

the same function as xinit(1).<br />

Action<br />

Check the PATH environment variable to make sure it contains the appropriate X<br />

windows install directory. Verify that xinit(1) is in this directory as an executable<br />

program.<br />

XIO: fatal IO error 32 (Broken pipe)<br />

on X server "<strong>string</strong>:0.0"<br />

Cause<br />

This means that I/O with the X server has been broken. The "0.0" represents the<br />

display device, which is usually the console. This message can appear when a user is<br />

running Display PostScript applications and the X server disappears or the client is<br />

shut down. Data loss is possible if applications disappeared before saving files.<br />

Action<br />

Try to run the application again in a few minutes after the system has rebooted and<br />

the window system is running.<br />

Xlib: connection to "<strong>string</strong>:0.0" refused<br />

by server<br />

Cause<br />

This message is immediately followed by the "Xlib: Client is not authorized to<br />

connect to Server" message. These messages indicate that an X windows application<br />

Alphabetical Message Listing 207


tried to run on the X server specified inside double quotes, which did not allow the<br />

request. The "0.0" represents the display device, which is usually the console. If no<br />

server name appears, the superuser probably tried to run an X application on the<br />

current machine in an X session that was owned by somebody else.<br />

Action<br />

To allow this client to connect to the X server, run xhost(1) +clientname on the X<br />

server system. Only the owner of the current X session (who is not necessarily the<br />

superuser) is allowed to run the xhost(1) command. If somebody else is running X<br />

windows on the server, ask them to log out and then start your own X session on<br />

that server; remote X connections are usually allowed for the same user ID.<br />

Xlib: extension "GLX" missing on<br />

display "0.0"<br />

Cause<br />

Install OpenGL 1.0 and test the configuration by running<br />

/usr/openwin/demo/GL/ogl_install_check which results in following:<br />

# ./ogl_install_check<br />

Xlib: extension "GLX" missing on display "0.0".<br />

Xlib: extension "GLX" missing on display "0.0".<br />

Xlib: extension "GLX" missing on display "0.0".<br />

can’t find visual<br />

Action<br />

First check that the installation has worked correctly by running the package check<br />

utility on the runtime package: # pkgchk SUNWglrt. This should result in an error<br />

message such as:<br />

ERROR: /usr/openwin/server/etc/OWconfig<br />

file size expected actual<br />

file cksum expected actual<br />

208 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


(The numbers might be different but there should be only 1 file.) If other errors<br />

result, re-install OpenGL, especially the SUNWglrt package.<br />

Assuming that is fine, look at the process owner for the Xsun process using:<br />

# ps -aef | grep Xsun | grep -v grep<br />

nobody 20022 225 0 11:36:22 ? 0:34 /usr/openwin/bin/Xsun :0 -nobanner<br />

If the owner is not root, that is most likely the problem since there will be a<br />

permission issue loading the graphic pipelines.<br />

If you are are using CDE, ensure that the Xservers file has the form<br />

:0 Local local_uid@console root /usr/openwin/bin/Xsun :0 -nobanner<br />

The Xservers file will be found in /usr/dt/config if you haven’t done any<br />

customization but more likely /etc/dt/config/. Additional arguments after the<br />

-nobanner option are acceptable.<br />

Another way of proving this is to run OpenWindows from the command line as root.<br />

It will ensure that the Xsun process is owned by root.<br />

Another possibility is that the system is NOT a Creator 3D. You can only run<br />

openGL 1.0 on an Ultra with a Creator 3D graphics card. If you install this<br />

application on an Ultra with a Creator framebuffer and NOT a Creator 3D you’ll see<br />

these same error messages.<br />

xterm: fatal IO error 32 (Broken Pipe)<br />

or KillClient on X server "<strong>string</strong>:0.0"<br />

Cause<br />

This means that xterm(1) has lost its connection to the X server. The "0.0" represents<br />

the display device, which is usually the console. This message can appear when a<br />

user is running xterm and the X server disappears or the client gets shut down.<br />

Data loss is possible if applications were killed before saving files.<br />

Action<br />

Try to run the terminal emulator again in a few minutes after the system has<br />

rebooted and the window system is running.<br />

Alphabetical Message Listing 209


XView warning: Cannot load font set<br />

’<strong>string</strong>’ (Font Package)<br />

Cause<br />

This message from the XView library warns that a requested font is not installed on<br />

the X server. Often multiple warnings are displayed for the same font. The set of<br />

available fonts can vary from release to release.<br />

Action<br />

To see which fonts are available on the X server, run the xlsfonts(1) program. Then<br />

specify another font name that you see in the output of xlsfonts(1). Sometimes it is<br />

possible to locate a similar font from a different vendor.<br />

Technical Notes<br />

There are two packages of X windows fonts: the common but not required fonts<br />

(SUNWxwcft), and the optional fonts (SUNWxwoft). Run pkginfo(1) to see if both<br />

packages are installed, and add them to the system as you want.<br />

"Y"<br />

yp_all RPC clnt_call (transport level)<br />

failure<br />

Cause<br />

At random times, a slave NIS server has a problem that causes ypbind(1M) to<br />

report: ypserver not responding and the machine must be rebooted. The<br />

syslog contains:<br />

Dec 14 07:11:03 rahab syslog: yp_all -<br />

RPC clnt_call (transport level) failure:<br />

RPC: Unable to receive; An event requires attention<br />

210 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


Action<br />

To workaround, increase the file descriptor limit in the yp startup script,<br />

/etc/rc2.d/S71rpc. Add this to the script before ypserv is started:<br />

ulimit -n 256<br />

ypbind[int]: NIS server for domain<br />

"<strong>string</strong>" OK<br />

Cause<br />

This message appears after an "NIS server not responding" message to indicate that<br />

ypbind(1M) is able to communicate with an NIS server again.<br />

Action<br />

Proceed with your work. This message is purely informational.<br />

ypbind[int]: NIS server not responding<br />

for domain " <strong>string</strong> "; still trying<br />

Cause<br />

This means that the NIS client daemon ypbind(1M) cannot communicate with an<br />

NIS server for the specified domain. This message appears when a workstation<br />

running the NIS naming service has become disconnected from the network, or<br />

when NIS servers are down or extremely slow to respond.<br />

Action<br />

If other NIS clients are behaving normally, check the Ethernet cabling on the<br />

workstation that is getting this message. On SPARC machines, disconnected network<br />

cabling also produces a series of "no carrier" messages. On x86 machines, the above<br />

message might be your only indication that network cabling is disconnected.<br />

If many NIS clients on the network are giving this message, go to the NIS server in<br />

question and reboot or repair as necessary. To locate the NIS server for a domain, run<br />

Alphabetical Message Listing 211


the ypwhich(1) command. When the server machine comes back in operation, NIS<br />

clients give an "NIS server for domain OK" message.<br />

See Also<br />

For more information about ypbind(1M), see the section on administering secure<br />

NFS in the NFS Administration Guide.<br />

ypwhich: can’t communicate with ypbind<br />

Cause<br />

This message from the ypwhich(1) command indicates that the NIS binder process<br />

ypbind(1M) is not running on the local machine.<br />

Action<br />

If the system is not configured to use NIS, this message is normal and expected.<br />

Configure the system to use NIS if necessary.<br />

If the system is configured to use NIS, but the ypbind(1M) process is not running,<br />

invoke the following command to start it up:<br />

# /usr/lib/netsvc/yp/ypbind -broadcast<br />

"Z"<br />

zsint: silo overflow<br />

Cause<br />

This message means that the Zilog 8530 character input silo (or serial port FIFO)<br />

overflowed before it could be serviced. The zs(7D) driver, which talks to a Zilog<br />

Z8530 chip, is reporting that the FIFO (holding about two characters) has been<br />

overrun. The number after zs(7D) shows which serial port experienced an overflow:<br />

212 Solaris Common Messages and Troubleshooting Guide ♦ October, 1998


zs0 - tty serial port 0 (/dev/ttya)<br />

zs1 - tty serial port 1 (/dev/ttyb)<br />

zs2 - keyboard port (/dev/kbd)<br />

zs3 - mouse port (/dev/mouse)<br />

Action<br />

Silo overflows indicate that data in the respective serial port FIFO has been lost.<br />

However, consequences of silo overflows might be negligible if the overflows occur<br />

infrequently, if data loss is not catastrophic, or if data can be recovered or reproduced.<br />

For example, although a silo overflow on the mouse driver (zs3) indicates that the<br />

system could not process mouse events quickly enough, the user can perform mouse<br />

motions again. Similarly, lost data from a silo overflow on a serial port with a<br />

modem connection transferring data using uucp(1C) is recovered when uucp(1C)<br />

discovers the loss of data and requests retransmission of the corrupted packet.<br />

Frequent silo overflow messages can indicate a zs(7D) hardware FIFO problem, a<br />

serial driver software problem, or abnormal data or system activity. For example, the<br />

system ignores interrupts during system panics, so mouse and keyboard activity<br />

result in silo overflows.<br />

If the serial ports experiencing silo overflows are not being used, a silo overflow<br />

could indicate the onset of a hardware problem.<br />

Technical Notes<br />

Another type of silo overflow is one that occurs during reboot when an HDLC line is<br />

connected to any of the terminal ports. For example, an X.25 network could be<br />

sending frames before the kernel has been told to expect them. Such overflow<br />

messages can be ignored.<br />

Alphabetical Message Listing 213

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!