RE: Very URGENT REQUEST - Postgresql error : PANIC: could not locate a valid checkpoint record - Mailing list pgsql-bugs
From | Silaparasetti, Ramesh |
---|---|
Subject | RE: Very URGENT REQUEST - Postgresql error : PANIC: could not locate a valid checkpoint record |
Date | |
Msg-id | BYAPR19MB2712FC65381EC6A1B1E3CCAFE02F9@BYAPR19MB2712.namprd19.prod.outlook.com Whole thread Raw |
Responses |
Re: Very URGENT REQUEST - Postgresql error : PANIC: could not locate a valid checkpoint record
|
List | pgsql-bugs |
Hi Team,
Below is the information you suggested to get, kindly let us know your response:
------------------------------------
1. Below is the output of the command : "<DPA_INSTALL_DIRECTORY>\services\datastore\engine\bin\pg_controldata.exe -D "<DPA_INSTALL_DIRECTORY>\services\datastore\data""
Our customer locale is German and below is the output.
C:\Program Files\EMC\DPA\services\datastore\engine\bin>pg_controldata.exe -D "F:\datastore\data\data"
pg_control-Versionsnummer: 1300
Katalogversionsnummer: 202007201
Datenbanksystemidentifikation: 7054941472659574120
Datenbank-Cluster-Status: heruntergefahren
pg_control zuletzt geändert: 07.02.2022 14:57:30
Position des letzten Checkpoints: 9/C80000D8
REDO-Position des letzten Checkpoints: 9/C80000D8
REDO-WAL-Datei des letzten Checkpoints: 0000000100000009000000C8
TimeLineID des letzten Checkpoints: 1
PrevTimeLineID des letzten Checkpoints: 1
full_page_writes des letzten Checkpoints: aus
NextXID des letzten Checkpoints: 0:230890
NextOID des letzten Checkpoints: 38960
NextMultiXactId des letzten Checkpoints: 1
NextMultiOffset des letzten Checkpoints: 0
oldestXID des letzten Checkpoints: 549
DB der oldestXID des letzten Checkpoints: 0
oldestActiveXID des letzten Checkpoints: 0
oldestMultiXid des letzten Checkpoints: 1
DB des oldestMulti des letzten Checkpoints: 0
oldestCommitTsXid des letzten Checkpoints: 0
newestCommitTsXid des letzten Checkpoints: 0
Zeit des letzten Checkpoints: 07.02.2022 14:57:30
Fake-LSN-Zähler für ungeloggte Relationen: 0/3E8
Minimaler Wiederherstellungsendpunkt: 0/0
Zeitleiste des minimalen Wiederherstellungsendpunkts: 0
Backup-Startpunkt: 0/0
Backup-Endpunkt: 0/0
End-of-Backup-Eintrag erforderlich: nein
wal_level-Einstellung: replica
wal_log_hints-Einstellung: aus
max_connections-Einstellung: 250
max_worker_processes-Einstellung: 8
max_wal_senders-Einstellung: 1
max_prepared_xacts-Einstellung: 250
max_locks_per_xact-Einstellung: 64
track_commit_timestamp-Einstellung: aus
Maximale Datenausrichtung (Alignment): 8
Datenbankblockgröße: 8192
Blöcke pro Segment: 131072
WAL-Blockgröße: 8192
Bytes pro WAL-Segment: 16777216
Maximale Bezeichnerlänge: 64
Maximale Spalten in einem Index: 32
Maximale Größe eines Stücks TOAST: 1996
Größe eines Large-Object-Chunks: 2048
Speicherung von Datum/Zeit-Typen: 64-Bit-Ganzzahlen
Übergabe von Float8-Argumenten: Wert
Datenseitenprüfsummenversion: 0
Mock-Authentifizierungs-Nonce: 7b9b893df8e583b7cf43c1647aa0433dc3dabf9eb00bde88bfaf879695cadf78
C:\Program Files\EMC\DPA\services\datastore\engine\bin>
--------------------------------
2. As you suggested, we verified the value of Latest checkpoint's REDO WAL file: 0000000100000009000000C8.
This WAL file does not exist at the pg_wal directory.
We have enabled debug logging and below is the logging information from Postgres.
2022-02-10 11:38:05.675 CET [7916] LOG: starting PostgreSQL 13.1, compiled by Visual C++ build 1900, 64-bit
2022-02-10 11:38:05.679 CET [7916] LOG: listening on IPv4 address "127.0.0.1", port 9003
2022-02-10 11:38:05.681 CET [7916] LOG: listening on IPv4 address "10.91.198.36", port 9003
2022-02-10 11:38:06.756 CET [348] LOG: database system was shut down at 2022-02-07 14:57:30 CET
2022-02-10 11:38:06.756 CET [348] DEBUG: mapped win32 error code 2 to 2
2022-02-10 11:38:06.757 CET [348] DEBUG: mapped win32 error code 2 to 2
2022-02-10 11:38:06.757 CET [348] DEBUG: could not open file "pg_wal/0000000100000009000000C8": No such file or directory
2022-02-10 11:38:06.760 CET [7916] DEBUG: reaping dead processes
2022-02-10 11:38:06.760 CET [7916] LOG: startup process (PID 348) was terminated by exception 0xC0000005
2022-02-10 11:38:06.760 CET [7916] HINT: See C include file "ntstatus.h" for a description of the hexadecimal value.
2022-02-10 11:38:06.760 CET [7916] LOG: aborting startup due to startup process failure
2022-02-10 11:38:06.760 CET [7916] DEBUG: shmem_exit(1): 0 before_shmem_exit callbacks to make
2022-02-10 11:38:06.760 CET [7916] DEBUG: shmem_exit(1): 3 on_shmem_exit callbacks to make
2022-02-10 11:38:06.760 CET [7916] DEBUG: cleaning up dynamic shared memory control segment with ID 1993677498
2022-02-10 11:38:06.774 CET [7916] DEBUG: proc_exit(1): 2 callbacks to make
2022-02-10 11:38:06.774 CET [7916] LOG: database system is shut down
2022-02-10 11:38:06.774 CET [7916] DEBUG: exit(1)
2022-02-10 11:38:06.774 CET [7916] DEBUG: shmem_exit(-1): 0 before_shmem_exit callbacks to make
2022-02-10 11:38:06.774 CET [7916] DEBUG: shmem_exit(-1): 0 on_shmem_exit callbacks to make
2022-02-10 11:38:06.774 CET [7916] DEBUG: proc_exit(-1): 0 callbacks to make
2022-02-10 11:38:06.778 CET [904] DEBUG: logger shutting down
2022-02-10 11:38:06.779 CET [904] DEBUG: shmem_exit(0): 0 before_shmem_exit callbacks to make
2022-02-10 11:38:06.779 CET [904] DEBUG: shmem_exit(0): 0 on_shmem_exit callbacks to make
2022-02-10 11:38:06.779 CET [904] DEBUG: proc_exit(0): 0 callbacks to make
2022-02-10 11:38:06.779 CET [904] DEBUG: exit(0)
-----------------------------------
3. Note that we have not got any Crash dump for Postgres service at below location:
"C:\Users\<USER>\AppData\Local\CrashDumps", where <USER> is the login starting the DB.
-----------------------------------
4. Is it ok to execute “pg_resetwal” to recover from this situation? Does it pose any data loss ?
Best Regards,
Ramesh
-----Original Message-----
From: Julien Rouhaud <rjuju123@gmail.com>
Sent: Thursday, February 3, 2022 9:45 AM
To: Pragati Agarwal
Cc: pgsql-bugs@lists.postgresql.org; Silaparasetti, Ramesh; Kishore, Nanda - Dell Team
Subject: Re: Postgresql error : PANIC: could not locate a valid checkpoint record
[EXTERNAL EMAIL]
Hi,
On Wed, Feb 02, 2022 at 11:29:45PM +0530, Pragati Agarwal wrote:
>
> > The following error "could not locate a valid checkpoint record"
> > occurs on a datastore server running on postgreSQL 13.1.
> > Datastore is hosted on windows 2016 and the system has McAfee virus
> > scan installed.
> > This error occurs when server start is attempted.
> >
> > [7804] LOG: starting PostgreSQL 13.1, compiled by Visual C++ build
> > 1914, 64-bit [8812] LOG: invalid primary checkpoint record [8812]
> > PANIC: could not locate a valid checkpoint record [7804] LOG:
> > startup process (PID 8812) was terminated by exception 0xC0000409
> > [7804] HINT: See C include file "ntstatus.h" for a description of the hexadecimal value.
> > [7804] LOG: aborting startup due to startup process failure [7804]
> > LOG: database system is shut down
Did you experience some hardware failure, or do the previous logs show any other problems?
Can you send the output of
pg_controldata.exe -D $PGDATA
where $PGDATA is the directory where the instance is stored.
The needed command line is likely to be:
"C:\Program Files\Postgresql\13\bin\pg_controldata.exe -D "C:\Program Files\Postgresql\13\data"
Also, there will be a line of the form:
Latest checkpoint's REDO WAL file: XXXXXXXXXXXXXXXXXXXXXXXX
Can you check if the reported files exists in your pg_wal directory (it's a subdirectory of the $PGDATA directory), and if its size is 16MB?
If yes, please check that the permission on the files allows the user starting the postgres service. You could also temporarily disable your antivirus to check if it's blocking access.
pgsql-bugs by date: